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CHAPTER | 


INTRODUCTION: THE FIELD STUDY 


THE GENERAL GOAL of this research project 
has been to study differences in the effects uponele- 
mentary, junior high school, and high school pupils 
of an ability grouping system that distinguished be- 
tween a curricuium that adjustedthe rate of presen- 
tation of curricular materials, and arandom group- 
ing system that differentiated the curriculum prin- 
cipally through the use of enrichment. The study 
explores a wide range of dependent variables that 
might be affected differentially by the two grouping 
treatments, including many not previously explored 
in ability grouping research. 


Scope of the Project 


The paper has been divided into three sections. 
The first section contains a brief description of the 
research procedures including the samples studied, 
the measures used, and the ability grouping and 
random grouping treatments used by the two partic- 
ipating districts. The second section is concerned 
with the achievement-related dependent variables in 
the two grouping treatments including general 
achievement, over and underachievers and study 
methods. The third section discusses what we might 
call non-cognitive dependent variables that could be 
influenced by the grouping treatments such as the 
pupil’s social adjustment, attitudes toward school, 
problems, self-concept, and personality. 

Perhaps the most important variable to be con- 
sidered in studies involving instructional methods 
is achievement. This research has explored the 
question of achievement differences inability 
grouped and random grouped classrooms more 
thoroughly than any work previously published, con- 
sidering a number of achievement areas over a four- 
year period, and measuring achievement differ- 
ences at all grade levels from fourth through 
twelfth. 

A careful study was also made todetermine 
whether study habits and methods differed under the 
two grouping treatments. This research explored, 
not only the claim often made by educators that 
ability grouping leads to better study methods among 
bright pupils, but also analyzed differences and 
changes in study habits for average and slow pupils 
in the two systems. 

The possible effects of ability grouping upon soc- 
ial adjustment and pupil peer status has also beena 
cause for concern among educators. Thisresearch 


has attempted to answer some of the questions ia 
this area by studying differences in sociometric sta- 
tus between comparable pupils in ability grouped 
and random grouped classrooms. 

Some educators have condemned ability grouping 
on the grounds that it is destructive to the individu- 
al’s self concept. It has been suggested that in the 
ability grouped classroom bright pupils become ar- 
rogant, while slow pupils become insecure and de- 
feated. ‘The slow pupil’s low opinion of himself, it 
has been suggested, leads him to adjust his goals 
and efforts downward to fit the label place on him in 
the ability grouping situation. This research ex- 
amines in considerable detail these and other ques- 
tions relative to self-concept differences in ability 
grouped and random grouped classrooms. 

Somewhat related to self-concept are a number 
of personality areas that may also be affected by the 
ability grouping situation. Personality measures, 
both of the group administered inventory type and 
the individually administered projective type, were 
employed in this research to explore the possible 
effects of ability grouping upon certain personality 
variables. 

The attitudes of pupils toward the school environ- 
ment and their adjustments to it were also explored 
in this research. This area has received little at- 
tention in previous research but appearsto be an 
important consideration in comparing the ability 
grouping and random grouping treatments. 


Research Procedures 


Two adjacent and closely comparable school dis- 
tricts in Utah provided the setting inwhich the above 
questions were explored. One of these districts 
employed random grouping with enrichment, and the 
other had adopted a system of ability grouping with 
acceleration, coincident with the start of this re- 
search. Over 2,500 pupils from the fourth, sixth, 
seventh, eighth, and ninth grade levels were selec- 
ted in the two districts at the beginning of the study. 
During the second year of the study, this sample 
was increased to about 4, 000 pupils. Research da- 
ta were collected over a four year period in order 
to appraise the long term effects of the two group- 
ing treatments. Pupils who were first tested early 
in grade four were followed through grade seven. 
As all samples were similarly followed over the 
period of the study, data were collected at all 
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grades from four through twelve. 

The district that employed ability grouping, re- 
ferred to throughout this report as District A, used 
a composite achievement test score as a basis for 
grouping elementary school pupils into self-con- 
tained classrooms at the elementary level. At the 
secondary level, ability grouping was employed in 
mathematics and science classes only. Pupils were 
grouped into three ability levels referred to in this 
report as superior, average, and slow,in both the 
elementary and secondary schools. 

The other district participating in this research, 
known as District R, employed random or heteroge- 
neous grouping and differentiated the curriculum for 
individual differences in the random grouped class- 
room through the use of enrichment. The District 
A and District R curricula were very similar in 
most subject areas and at most grade levels, and in 
many instances the materials employed in District 
A to adjust the rate of presentation of the curricu- 
lum were also used in District R to provide enrich- 
ment in heterogeneous classes. This project called 
for comparisons between pupils in the two grouping 
treatments who were of comparable ability level. 
Pupils in District A who had been assigned to the 
three types of ability groups were measured by the 
investigators and then, after administering the 
same measure to District R pupils, the latter were 
classified into three ability levels so as to provide 
samples of pupils of comparable level inthe twodis- 
tricts. 

Except during the pilot study year, the measures 
employed in this study were not part of the regular 
testing program of either district and were admin- 
istered by teams of examiners trained and super - 
vised by the principal investigator. These examin- 
ers were employed by Utah State University and 
were not affiliated with either school district. 


Analysis 


Data collected in this research were analyzed 
using several procedures. In cases where it was 
desirable to measure changes that had taken place 
in pupils who had been exposed to the two grouping 
treatments, analysis of covariance was used. For 
example, this technique was employed in studying 
the achievement data. Inthis case, the final achieve- 
ment scores of the various groups studied were ad- 
justed to compensate for differences present at the 
beginning of the period covered by the analysis. 
Analysis of variance was employed to compare sub- 
jects on most of the other dependent variables. Var- 
iables that were analyzed using this technique in- 
cluded those for which it appeared reasonable to as- 
sume that the sampling techniques would lead to in- 
itially comparable groups, and those for which an 
initial measure could not be obtained. 

For other portions of the researchdata, chi- 
square analysis was employed. This technique was 


generally used in cases where the data fell intoa 
number of well-defined catagories or incases where 
parametric statistics did not appear appropriate. 
Much of the sociometric data were analyzed using 
chi-square contingency tables because this permit- 
ted use of the standard sociometric classifications 
(stars, isolates, etc.) that are generally understood 
by persons familiar with this area of measurement. 

Three major types of differences were consid- 
ered in analyzing most of the data. Thefirst and 
most important comparisons were between compa- 
rable pupils in the two grouping treatments. These 
comparisons involved pupils of the same sex and 
ability level in the ability grouped andrandom 
grouped samples and are referred to throughout the 
study as between-treatment differences. These dif- 
ferences provide the best evidence on the differen- 
tial effects of ability grouping and random grouping 
upon the dependent variables. 

Comparisons between pupils of the same sex and 
in the same grouping treatment who differed in abil- 
ity level are referred to as level or within-treat- 
ment differences. These comparisons tell us wheth- 
er the grouping treatments had a differential effect 
upon pupils at different ability levels and give us ad- 
ditional insight about the treatment that help to in- 
terpret the between-treatment differences. Boys 
and girls who were in the same grouping treatment 
and at the same ability level were compared to pro- 
vide knowledge about the sex-related differences on 
the dependent variables. In some instances, these 
sex differences also provided additional gnsights in- 
to the effects of the grouping treatment. 


THE SAMPLE 


As is the case with most field studies in educa- 
tion, random samples of broadly defined populations 
could not be obtained in the research herein de- 
scribed. Even if such a random sample had been 
available, the problem of randomly dividing sucha 
sample and exposing half of them to a school situa- 
tion involving ability grouping and the other half to 
random grouping would have been nearly impossible. 
Random samples of more narrowly defined popula- 
tions such as ‘‘all pupils enrolled in regular fourth 
grade classes in school district X’’ can be obtained 
in field studies, but even this approach often causes 
administrative problems that public school officials 
consider too great a disruption of the school’s ac- 
tivities. Thus, the educational field study must 
usually make compromises in sample selection that 
would be unacceptable in the more rigorous setting 
of the psychological laboratory. 

Yet, the field study has certain advantages that 
make it a useful approach in educational research. 
First, although the discipline of the laboratory is 
sacrificed, the artificiality that usually attends it is 
greatly reduced in the field study. Thus, the field 
study gathers data in realistic and reasonably typi- 
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cal school situations and its findings are often di- 
rectly applicable to other school situations. Incon- 
trast, the laboratory school which employs highly 
selected teachers to work with small classes un- 
hampered by financial limitations and free to draw 
professional support from a university faculty, of- 
ten produces research results that have little carry- 
over to the usual school in which classes are large, 
teachers are average, and money is scarce. Sec- 
ond, the field study can work with larger numbers 
of pupils than are usually manageable in laboratory 
studies. This permits identifying and studying sub- 
groups that would be too small to consider with the 
usual laboratory sample. More important, these 
large samples offer a greater possibility than vari- 
ables one cannot control, and, for that matter, vari- 
ables that we are not even aware of will ‘‘random- 
ize out’’ or tend to occur equally in the groups we 
wish to compare. 

The researcher doing a field study, though us- 
ually unable to work with truly random samples, is 
still obligated to make his samples as random as 
the situation permits and to be alert to biasing ele- 
ments in his sampling procedure that may distort 
his findings. His second obligation is to describe 
with reasonable thoroughness the situation in which 
the research was carried out. Unless this is done, 
the reader has little upon which to base his evalua- 
tion of the project ana his estimate of the pertinence 
of the findings to other pupils in other schools. The 
nature of the schools involved, the comparability of 
their curricula, the numbers of pupils inthe sample 
and how they were selected are among the topics to 
be discussed in this section. 


District Comparisons 


District R, from which the random grouped sam- 
ple was selected, is a city school district having an 
average daily attendence of about 17, 000 at the time 
of the study. District A, from which the ability 
grouped sample was selected, is the surrounding 
county school district with an ADA of about 12, 000. 
Similarities and differences between these districts 
must be considered in interpreting the results of 
this research. It is impossible, of course, to com- 
pare the two districts on all aspects of the educa- 
tional environment they provide and the population 
they serve. Certain critical comparisons, how - 
ever, have been made and are reported throughout 
this section. 

In terms of financial resources for instruction, 
the two districts were closely comparable during 
the four years of this study. The yearly average 
amount available for instruction was $238. 76 per 
pupil in District A and $230. 48 in District R. When 
it is considered that District A found it necessary 
to purchase a number of materials inorder to fit 
their curriculum to the ability grouping system, it 
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would seem appropriate to conclude that financial 
resources available for instruction were closely 
comparable. 

Perhaps the most important element to consider 
is the relative skill of teachers inthe ability grouped 
and random grouped classrooms from which the re- 
search samples were selected. Unfortunately, 
a valid measurement of teaching ability is extremely 
difficult if not impossible at our present state of 
knowledge. Thus, our comparisons between teach- 
ers must rely on variables which, though indirect, 
are probably valid indicators of the skill of a given 
group of teachers. 

Salaries paid to participating teachers in the two 
districts were very similar with the first year mean 
for District A being $4744 as compared with a Dis- 
trict R mean of $4788. Teachers inthe twodistricts 
were also closely comparable in terms of certifica- 
tion, but in terms of teaching experience a signifi- 
cant difference existed with District R teachers hav- 
ing on the average about four and a half years more 
experience than District A teachers. This differ- 
ence existed at both secondary and elementary 
school levels. All in all, however, teachers in the 
two districts were reasonably comparable with nei- 
ther district showing a consistent superiority on the 
variables compared. 

Another important factor that must be considered 
in comparing Districts A and R is the curriculum. 
Major differences in the basic curricula of the two 
districts would make achievement comparisons 
meaningless because any achievement measure used 
would then be more appropriate for one district than 
the other. A study in which the same curriculum 
was adjusted by the two different treatments — abil- 
ity grouping with acceleration and random grouping 
with enrichment — would provide an ideal situation 
for comparing the treatments. The basic curricula 
in Districts A and R, though not identical, were very 
similar and provided a sound basis for comparing 
the two teaching approaches with whichthis research 
is concerned. In the State of Utah, most school sub- 
jects are covered by curriculum guides that have 
been developed cooperatively by the Utah State De - 
partment of Public Instruction and the school dis- 
tricts of the state. These guides are published by 
the state and were followed by both of the participa- 
ting school districts; not only in the subject areas 
that were measured in this research, but also in 
most other subjects taught inthe elementary and sec- 
ondary schools. Both districts also selected their 
textbooks from the state approved textbook list which 
is made up of comparable textbooks meeting the ob- 
jectives of the state curriculum guides. Ii some 
subject areas, the two districts used the same text- 
books. Thus, the curricula offered by the two dis- 
tricts were very similar in terms of major objec- 
tives and sequences. In many cases, the basic Dis- 
trict A curriculum which was geared to pupils in the 
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average ability group, was found to be almost iden- 
tical in rate, sequence, and content to the District 
R basic curriculum for heterogeneous groups. Dif- 
ferences did occur, however, between the District 
A and Rcurricula for superior and slow pupils, 
largely because of the considerable adjustments in 
rate of instruction made in District A. For exam- 
ple, nearly all District A superior pupils were 
scheduled to take Algebra 1 inthe eighth grade; 
while District R pupils did not take Algebra 1 until 
the ninth grade. As enrichment materials were 
available in District R classrooms, one would ex- 
pect that the District R superior pupils, while mov- 
ing at a Slower pace than their counterparts in Dis- 
trict A, would have more time for independent study 
and projects and might develop greater depth in the 
subjects covered. 


Selecting Research Subjects 


During the pilot study year of the project, sub- 
jects were selected from the fourth, sixth, seventh, 
eighth, and ninth grades. Table 1 givesthe number 
of pupils in each of these five samples during the 
four years of the project. These data are based on 
numbers of pupils completing eachachievementtest 
battery. 

Because afairly large number of variables were 
explored in this research, the numbers of pupils for 
whom complete data are available changes from one 
variable to another. For example, a pupil may have 
completed all tests relating to personaiity or self- 
concept but may have missed and beenunable to 
make up one of the achievement tests. He may, 
therefore, be included in the self-concept and per- 
sonality analysis but not in parts ofthe achievement 

analysis. The number of students available for the 
achievement analysis was generally smallest be- 
cause the achievement batteries were longer, were 
usually administered over a period of four days at 
the elementary level and two days at the secondary 
level and were thus more difficult to make up. In 
general, however, this table gives the reader a rea- 
sonable picture of the numbers of pupils involved in 
the Utah study. 

Initial samples were drawn in classroom units 
rather than by individuals. This procedure, of 
course, is less satisfactory than random selection 
of individual cases. 

Pupils who werein tne fourth grade at 
the start of the project (1958-59), are referred to 
as Sample IV throughout this report. This fourth- 
grade sample in District A came from nine class- 
rooms in three elementary schools. Three superior 
sections, three average sections and three slow 
sections were included, one of each from each 
school. The District R sample came from 13class- 
rooms in six schools. These schools were selected 
for comparability with the District A schools in 
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terms of socioeconomic status of the neighborhood 
served. Classrooms were selected at random from 
among the available classrooms in each school. As 
District R pupils were randomly grouped, a larger 
sample was considered necessary in order to insure 
getting enough subjects at each ability level. Prior 
to the beginning of the second year of the study, sup- 
port was obtained from the Cooperative Research 
Program, U. S. Office of Education, tocarry out 
the remaining three years of the project. At this 
time the number of pupils in the fourth grade sam- 
ples was increased. 

Pupils selected from the sixth grade during the 
initial year of the study have beendesignated as 
Sample VI. The District ASample VI was taken 
from 12 classrooms in four schools and included 
four superior sections, four average sections, and 
four slow sections. The sixth grade sample in Dis- 
trict R was taken from 16classrooms ineight 
schools. The procedure for selecting sixth grade 
classrooms was the same as for fourth grade class- 
rooms. 


Seventh grade pupils selected during the initial 
year of the study are designated as Sample VII. Dur- 
ing the pilot study year, the District A sample con- 
sisted of all pupils in three seventh grade mathe- 
matics classes, one each being randomly selected 
from the superior, average and slow classes. The 
Sample VII subjects for District R during the first 
year were taken from seven randomly selected sev- 
enth grade mathematics classes. These samples 
were substantially enlarged during the second year 
of the study. 


Eighth grade pupils selected during the initial 
year are designated as Sample VIII. These pupils, 
as was the case with Sample VII, were taken from 
the junior high schools that ordinarily serve the 
schools from which the elementary samples were 
drawn. District A Sample VIII consisted of all pup- 
ils in six eighth grade mathematics classes; two 
randomly selected from each ability level. District 
R Sample VIII consisted of all pupils in seven ran- 
donly selected eighth grade mathematics classes. 
Both samples were enlarged during the second year 
of the study. 


Pupils in the ninth grade during the initial year 
of the study are designated as Sample IX. These 
pupils were studied from the beginning of the ninth 
grade through the end of the twelfth grade. In Dis- 
trict A, this sample included all pupils in three 
ninth grade algebra classes, and three ninth grade 
science classes, one in each subject at slow, aver- 
age, and superior levels. Because many pupils at 
the superior and slow levels were in both the math- 
ematics and science classes selected, this sample 
was quite small. In District R, Sample IX included 
all pupils taken from four randomly selected ninth 
grade mathematics classes. 
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Comparability of Research Subjects 


Although considerable effort was made to avoid 
techniques that would lead to biasing of the samples, 
the techniques used were far from ideal. Thus, 
during the course of the study, and especially inthe 
pilot study year, data were collected to check our 
success in selecting reasonably com parable sub- 
jects. Some of these data will now be presented. 


Socioeconomic Class: It will be recalled that ele- 
mentary schools were selected at the outset of the 
study that were considered to serve comparable 
socioeconomic groups in the two districts. A pre- 
liminary check of socioeconomic status made in 
these schools indicated no significant differences. 
It was considered desirable, however, to make a 
more thorough check of socioeconomic differences 
as this variable could have a considerable effect up- 
on the outcome of the research. The Hollingshead 
Occupational Scale was selected as a basis for estab- 
lishing five socioeconomic class levels. This scale 
is based entirely upon the occupation of the father 
or the principal family wage earner. Although much 
easier to obtain than some of the more elaborate 
socioeconomic class measures suchas Warner’s 
Index of Status Characteristics, the Hollingshead 
Occupational Scale has been found tocorrelate high- 
ly with such measures (61). These data were col- 
lected during the final year of the study. At this 
time, Sample IV and Sample VI pupils were in the 
seventh and ninth grades respectively. Atthis level 
they were able to provide more information on their 
father’s occupations than they could have at the ele- 
mentary school level. The distribution of cases in- 
to the five Hollingshead socioeconomic class levels 
was compared using chi square contingency tables 
to determine whether pupils of comparable ability 
levels in the two districts were significantly differ- 
ent in their distribution into the five social class 
levels. Of the six chi square values calculated for 
Samples IV and VI, one was statistically significant. 
The significant chi square was found between Dis- 
trict A and District R superior pupils inthe Sample 
VI. This chi square reached significance at the. 01 
level and indicated a tendency for District R supe- 
rior pupils to be of higher socioeconomic status. 
None of the other chi square values reached signifi- 
cance at the .05 level and most were small, indi- 
cating no true differences in socioeconomic distri- 
bution between the two samples. The significant 
difference, however, between superior Sample VI 
pupils in Districts A and R should be taken into con- 
sideration in interpreting differences found between 
these groups on the other variables considered in 
this research. 


Intelligence Differences: Comparisons of mean 


intelligence levels for District Aand District R 
show that most of the parallel groups inthe two dis- 


tricts were closely comparable in mean mental age 
expressed in grade placement equivalents. For the 
language score on the California Short- Form Test 
of Mental Maturity, the average difference between 
comparable ability groups in Districts A and R was 
less than two-tenths of a grade. 

The differences between comparable District 
A and District R pupils were somewhat larger onthe 
CMM non-language score, averaging nearly three- 
tenths of a grade. The greatest difference was 
found between the slow pupils in Sample IX and fav- 
ored the District R pupils. Even this largest differ- 
ence represents less than two raw score points 
(28.95 vs. 27.25) and falls far short of statistical 
significance. 


THE TREATMENT 


Because terms such as “‘ability grouping’’, ‘‘en- 
richment’’ and ‘‘heterogeneous grouping”’ have dif- 
ferent meaning in different situations, it is neces- 
sary to define operationally their meaning in a re- 
search project so that the reader fully understands 
the specific circumstances under whichthe data 
were collected. 


The Ability Grouping System 


The ability grouping system used in District A 
during the course of this research employed a stand- 
ard achievement battery, the California Achieve- 
ment Test Battery (CAT), as a basis forgrouping 
pupils into three ability levels. Anachievement 
measure was selected instead of scholastic aptitude 
or intelligence measures largely because the district 
administration found that grouping on the basis of 
past achievement was more acceptable to parents 
than grouping on the basis of ability. Scores ona 
battery of achievement tests have generally been 
found to correlate rather highly with measures of 
scholastic aptitude, so it is doubtful if any marked 
difference in the grouping would grow out of achange 
from one of these measures to another. 

At the elementary school level, the total grade 
placement score on the California Achievement Test 
Battery, Form W, was used as a basis for grouping. 
Pupils were administered the test battery in the 
spring of each school year. Pupils at a given grade 
level were then divided into boys and girls anda sep- 
arate list made up for each sex starting with the pu- 
pil having the highest grade placement score and 
proceeding in order to the pupil with the lowest score. 
The lowest ten boys and lowest ten girls were then 
selected and were placed in the slow section. Next, 
the highest 15 boys and highest 15 girls were select- 
ed and were placed in the superior section. The re- 
maining pupils were placed in two or three average 
sections by placing every second or third pupil ina 
given section. Thus, the average sections witnin a 
given school were comparable in terms of ability 
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level and range rather than being divided into ‘‘high 
average’’ and ‘‘low average’’ sections. Bothdis- 
tricts had a separate program of special classes for 
mentally retarded pupils, so these were not includ- 
ed in either the ability grouped or random grouped 
classes. A number of special provisions were 
made for unusual situations such as anexcessive 
number of superior or slow pupils in a given school, 
too large an ability gap between boys and girls as- 
signed to a slow section, and others that seemed to 
call for special treatment. This simple three-level 
grouping system based on a composite achievement, 
score has both advantages and disadvantages. 
Among its advantages are that it may be carried out 
in relatively small schools, it is easy touse and 
generally acceptable to parents, and it permits a 
relatively easy transition for pupilstransferring to 
or from schools employing random grouping. Fi- 
nally, this system permits the use of self-contained 
classrooms at the elementary level. When specific 
grouping criteria are established on a subject- by - 
subject basis, some sort of departmentalized pro- 


gram is usually called for, as pupils willsometimes . 


fall into different ability groups for different subject 
areas. 

The principal disadvantage of the grouping sys- 
tem employed in District A at the elementary level 
is that the use of a general grouping criterion such 
as a composite achievement score results inonly 
a moderate reduction in heterogeneity. It does not 
produce pupil groups that are at or even near the 
same level on all subjects included in the curricu- 
lum. Thus District A employed what might be re- 
ferred to as a conservative grouping system that 
can be adapted to most elementary schools, but 
brings about only moderate reductions in heteroge- 
neity. 

At the secondary school level, District A em- 
ployed ability grouping in mathematics and science 
classes only. Even within these two subject areas, 
the classes that could be satisfactorily grouped be- 
came fewer as the pupil moved into the higher grade 
levels. The grouping criterionfor mathematics was 
the grade placement score of the Mathematics sub- 
test of the California Achievement Test Battery. 
The achievement score obtained in the spring onthis 
subtest formed the basis for grouping pupils into dif- 
ferent ability levels the following fall. In science, 
on the other hand, since the CAT has no science sub- 
test, the composite grade placement score for the 
Mathematics and Language subtests was used, thus 
providing a general criterion for grouping in this 
subject. 

Ability grouping was employed in all mathemat- 
ics and science classes during the junior high- 
school grades. At the high school level, ability 
grouping was only possible in the lower level math- 
ematics and science courses that are taken by asiz- 
able percentage of pupils. In such elective courses 
as physics and chemistry, the maximum amount of 


grouping possible was at two levels since virtually 
no slow pupils took these elective physical science 
courses. 


The Enrichment System 


In District R, a program of heterogeneous group- 
ing plus enrichment was employed at all grade lev- 
els. Atthe elementary school level, pupils were 
randomly assigned to the various sections within a 
given grade level. All principals inDistrict Rwere 
briefed on the research project by the project direc- 
tor during the pilot study year of the project and the 
superintendent set up a specific policy for placing 
pupils in sections that required pupils to be random- 
ly assigned. At the secondary level, the same pol- 
icy was in effect. 


Reduction in Heterogeneity 


Because ability grouping systems attempt to make 
adaptation for individual differences easier by re- 
duction of heterogeneity within the classroom, it is 
important in the appraisal of sucha systemto de- 
termine just how much of a reduction in heterogene- 
ity is achieved. The amount that heterogeneity can 
be reduced by ability grouping is dependent upon 
several variables. Among these are the specificity 
and validity of the grouping criterion, the number of 
grouping levels to be established and the variability 
and numbers of pupils to be grouped. If we have 
large numbers of heterogeneous pupils, employ mul- 
tilevel grouping and have a highly validcriterion, we 
can achieve great reductions in heterogeneity in a 
specific subject area. 

A criticism often leveled at ability grouping pro- 
grams, particularly those employing general crite- 
ria such as an intelligence test scoreor acomposite 
achievement test score, has been that such a pro- 
gram reduces the heterogeneity of pupils in specific 
subjects to sucha slight degree that the teacher, in 
effect, has just as heterogeneous a class in specific 
subject areas as would be found if no grouping had 
taken place. 

During the pilot study year, our samples were 
studied carefully to determine the reduction in het- 
erogeneity achieved by the grouping system used in 
District A. In order to determine the variability of 
those classrooms included in the research from Dis- 
trict A and District R, means and standard devia- 
tions were computed for each elementary school 
classroom on the California Mental Maturity Short 
Form Language Score, the CAT Arithmetic total, 
CAT Language total, and the CAT Reading total. The 
average variability in District A on the various sub- 
tests was found to be 26 percent less than inDistrict 
Rfor the elementary sample. The smallest differ- 
ence in variability was found between fourth grade 
pupils in arithmetic where the average standard de- 
viation was only about ten percent smaller for Dis- 
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trict A. The greatest difference was found inthe 
reading scores of sixth grade pupils where the 
standard deviation was nearly 45 percent smaller 
in District A than in District R. 

A similar overall reduction in variability was 
found at the junior high school level. At this level, 
however, the reduction in variability was usually 
much greater for superior and slow groups than for 
average groups. This is because a larger propor- 
tion of the pupils were placed in average groups at 
the junior high school level, and therefore, these 
average groups were nearly as variable as random 
groups in District R. : 


Curriculum Adjustments 


We have discussed the procedures used by Dis- 
tricts A and R to group pupils for instruction and 
the differences in classroom homogeneity resulting 
under these systems. Grouping systems, however, 
are but a means to an end; this end being to provide 
a Situation in which more adequate curriculum ad- 
justments for individual differences are possible. 
Therefore, in order to evaluate the results obtained 
by ability grouping or enrichment programs, it is 
necessary that the reader have information concern- 
ing the adjustments made in these programs to 
adapt the curriculum to pupils at different levels of 
ability. 

Many of the ability grouping studies carried out 
in the 1920’s and 30’s, such as Worlton’s (123) re- 
search, went to great lengths to maintain identical 
curricula for pupils at all ability levels inability 
grouped classes. In other words, every effort was 
made to avoid any adjustments inthe curriculum that 
may have better adapted it to the superior or slow 
pupil. Such an approach, of course, overlooks the 
main purpose of ability grouping. Thus, in striving 
for better control, these studies eliminated the prin- 
cipal element they should have studied. Some ofthe 
studies reported in the literature have made con- 
siderable adjustments in the ability grouped classes 
but none in the random grouped classrooms used 
for comparison. This also appearsto bea question- 
able approach because in effect, it involves compar- 
ing ‘‘something’’ with ‘‘nothing’’, i.e., adefinite 
program versus no program to adjust the curricu- 
lum to pupil ability. 

In the research reported herein, both of the dis- 
tricts involved had positive programs and were 
closely comparable in the resources available for 
implementing these programs. Both the accelera- 
tion employed in District A and the enrichment pro- 
gram employed in District R attempted curriculum 
adjustment and, for that matter, the two districts 
sometimes employed identical materials for making 
adjustments for different ability levels. These ad- 
justments in District R, however, were made in 
classes in which no attempt had been made to reduce 
the heterogeneity of the group. 


In general, the goal of the District A program at 
both elementary and secondary level was to adapt 
curricular materials to the different ability levels 
and concurrently adjust the rate of presentation to 
the level of the child. Thus, at a given grade level 
the slow group would use curricular materials ap- 
propriate to their ability and would be working on 
learning objectives that had probably been attained 
by the average groups earlier in the year and by the 
superior groups during the preceding year. Adjust- 
ments in rate rather than depth to provide for indi- 
vidual differences is characteristic of acceleration 
programs as opposed to enrichment programs. The 
curriculum that progresses at different rates for 
different pupils requires a careful integration from 
grade to grade and between the elementary and sec-_ 
ondary programs. Otherwise the bright child will 
find himself repeating in seventh grade material he 
learned in the fifth grade and the slow child will find 
himself expected to have knowledge and skills that 
he has never been taught. Perhaps the main advan- 
tage of the accelerated program is that it permits 
the superior student to move to more advanced 
course work than is possible in an enrichment pro- 
gram. For example, the pupil who finishes the se- 
quence of general science courses a year earlier 
than usual can take one or twomore of the advanced 
electives in the physical sciences or natural sci- 
ences than he could otherwise fit into his high school 
course of study. 

In contrast to District A where the maingoal was 
to adjust rate of learning to individual differences, 
District R employed enrichment in heterogeneous 
classrooms to adjust the depth of learning to indi- 
vidual differences. The superior pupil in District 
R would therefore not take specific courses, such 
as algebra, any earlier in his program than would 
other pupils. If successful, however, such an en- 
richment program should develop afuller under- 
standing of the work covered by the superior pupil. 
The enrichment program also provides the bright 
pupil with more time for independent thought and 
study than would be available to him in an acceler- 
ated program. Whether such pupils generally prof- 
it from this free time or instead becomelazy and 
develop poor study habits and attitudes is one of the 
questions with which the Utah study was concerned. 

Let us now discuss some of the specific adjust- 
ments made by the two districts in order to provide 
the reader with greater insight into the setting in 
which this research was conducted and the success 
of the two systems in carrying out adjustments dic- 
tated by their respective policies of enrichment and 
acceleration. 


District A ElementaryClasses: No regular in- 
service training program was carried out to prepare 


District A teachers in the elementary grades to han- 
dle classes of either superior or low ability pupils. 
However, supervisors worked with the teachers and 
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held study groups and conferences during the course 
of the study to help teachers improve their adapta- 
tion of the curriculum to these groups. 

Adjustments in the reading program included 
adopting different textbooks which were appropriate 
in terms of reading level for the pupils in the supe- 
rior, average, and slow groups at each grade level. 
When a child was encountered who required materi- 
als outside of the range provided to classrooms at 
his grade and ability level, additional materials 
were obtained from the supervisor on an individual 
basis. 

The arithmetic program was also adapted to the 
three ability levels. This adaptation involved the 
use of simpler materials and a slower rate of prog- 
ress at the slow ability level and accelerated prog- 
ress through more complex materials at the supe- 
rior level. In the area of arithmetic relatively lit- 
tle supplemental material was made available tothe 
classroom teacher as the primary goal of the pro- 
gram was completion of the regular sequence in 
arithmetic at either a faster or slower rate for pu- 
pils of different abilities. 

In the elementary science program, slow pupils 
used a textbook that was designed for the grade lev- 
el previous to the one in which the pupils were en- 
rolled. Regular and superior pupils used the regu- 
lar textbook designed for their grade level. Supple- 
mental materials for the slow classes included the 
basic science education series published by Rowe- 
Peterson for the primary grades. Superior pupils 
had available the same series for the junior high- 
school grades, while regular pupils used the inter- 
mediate grade level series. Science was taught in 
the District A elementary schools using predomi- 
nantly the whole class instructional approach, al- 
though pupils also carried out individual projects. 

The adaptation of the social studies curriculum 
for superior pupils was made primarily through the 
use of supplemental materials. Relatively little ad- 
justment was made in the social studies curriculum 
for slow pupils. 


District R Elementary Classes: Classroom li- 
braries were provided in all District R elementary 


classrooms. These varied in size but were gener- 
ally quite extensive, and contained enrichment ma- 
terials aimed at adapting the curriculum to individ- 
ual differences. Most of the teacher guides used by 
elementary school teachers in District R suggested 
materials and methods for adjusting the curriculum 
for different ability levels within the classroom. 
Perhaps the most extensive adjustments in the 
District R elementary classes were in the teaching 
of reading. Reading materials over a wide range of 
levels were available in each classroom. Intra- 
class grouping was used for reading instruction. 
Fewer materials for adjusting the District Rcur- 
riculum to the different ability levels were available 
in arithmetic than in the other subject areas con- 


sidered inthis research. Arithmetic teaching was 
carried out on a whole class basis andthe time avail- 
able to the teacher for working with individual chil- 
dren was limited. 

The science curriculum in District R called for 
whole class instruction coupled with individual pro- 
jects. Extensive enrichment materials were avail- 
able in each classroom for adapting the curriculum 
to both slow and superior pupils although more ma- 
terials were available for superior pupils than for 
slow pupils. 

The social studies curriculum at each grade lev- 
el in District R employed a single textbookfor all 
pupils with a moderate amount of supplemental ma- 
terials in the classroom libraries. 


Curriculum Adjustments inthe Secondary Schools: 
At the junior high level, District A adjustments for 


the different ability groups inn.athematics were 
achieved primarily through adjusting the rate at 
which pupils covered the mathematics curriculum. 
Superior pupils covered about four years of work 
during the three year junior high school period, 
average pupils covered the usualthree year se- 
quence and slow pupils covered what would be two 
years of work for the average group. 

In making adjustments to the science curriculum, 
District A used both enrichment materials and ad- 
justments in the rate at which these materials were 
presented. Again, the superior pupils covered work 
equivalent to that usually covered infour years by 
average pupils and also worked to a limited degree 
with advanced enrichment materials designed to 
build greater depth. Average pupils in science fol- 
lowed the usual junior high school science sequence 
while slow pupils covered the equivalent of two years 
of the average science program during their three 
years and worked extensively withsupplemental ma- 
terials designed for slow learning pupils. 

In District R, a wide selection of enrichment ma- 
terials was available in junior high school m athe- 
matics classes. In science, a very large selection 
of enrichment materials was available; these mate- 
rials having been purchased forthe mest part 
through an NDEA grant. Science teachers had car- 
ried out surveys to identify enrichment materials to 
be purchased and were therefore familiar with the 
materials available. More enrichment materials 
were available for superior pupils than for slow pu- 
pils. At the junior high school level many of the 
materials used in District Rfor adapting the cur- 
riculum in heterogeneous classes through enrich- 
ment were also used in the ability grouped classes 
of District A. Both districts used the Rowe- Peter- 
son Enrichment Series, for example. 

At the high school level, both districts required 
only one year each of mathematics and science. The 
electives taken by pupils thus became difficult to 
control and differences between the ability grouped 
and random grouped treatments became increasing- 
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ly smaller. Ability plays a largerole inall second- 
ary schools in determining which students will take 
the more advanced electives in science and mathe- 
matics. 

Thus, the treatment differences in the Utah study 
were substantial at the elementary level and it is felt 
the findings can be accepted with considerable con- 
fidence. The treatment differences at the junior 
high school level, although subject to some limita- 
tions in the ninth grade, were still large enough so 
that comparisons between the twotreatmentsare 
felt to be reasonably valid and wellfounded and 
should have many implications for teachers and ad- 
ministrators. At the high school level, however, 
the findings should be regarded at best as tentative. 


Results for the tenth grade are probably meaningful 
indications of treatment differences because fewer 
electives were available during the tenth grade and 
furthermore, tenth grade results can be regarded 
to a considerable degree as reflecting the effects of 
earlier work at the junior high school level. At the 
eleventh and twelfth grades, however, a wide range 
of electives is available and there is a tendency for 
ability grouping to occur naturally in both districts 
because of the restricted ability range of pupils 
choosing mathematics and science electives. These 
circumstances lead the principal investigator to sug- 
gest that treatment differences are limited at this 
level and the reader should interpret the findings 
with caution. 
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CHAPTER 2 


ACHIEVEMENT-RELATED VARIABLES 


THE GOAL of this section is to explore the rela- 
tive effectiveness of ability grouping and random 
grouping in terms of achievement-related perform- 
ance of pupils under these two treatments. We will 
examine the rather extensive previous research that 
has been carried out in this area, and will then add 


the findings of this project. In the Utah study, | 


achievement data were collected for all subjects at 
the beginning of the study, and on at least three ad- 
ditional occasions during the subsequent four years. 
Achievement data were collected on at least one of 
the five samples at every grade level from fourth 
through twelfth, and for some grade levels, data 


were collected on as many as four different samples. 
In addition, studies of overachievers and under- 


achievers and study methods were carried out. 


PREVIOUS RESEARCH ON ACHIEVEMENT AND 
ABILITY GROUPING 


General Achievement 


Over the years, perhaps the major appeal of abil- 
ity grouping to the educator has been its promise to 
provide a classroom situation in whichbetter adjust- 
ments of the curriculum for individual differences 
might be possible. The goal of such curricular ad- 
justments generally is to bring about higher achieve- 
ment by providing more adequately for individual dif- 
ferences. In view of this appeal, it is not surpris- 
ing to discover upon reviewing the research litera- 
ture related to ability grouping that the vast major- 
ity of studies in this areaare concerned with 
achievement gains in ability grouped and random 
grouped classrooms. 

A number of reviews of research inthis field 
have been published. The reader is recommended 
to Ekstrom (34) who has done a thorough job of re- 
porting and critically evaluating the research on 
ability grouping up to the dateofherreview. Rather 
than duplicate this and other excellent reviews that 
have been published, we will review in detail only a 
few outstanding studies, many of which have been 
completed since Ekstrom’s review was _ published. 
This rather detailed coverage has been supplement- 
ed by two tables that give the essential information 
on a large number of additional studies that may in- 
terest the reader. Table 2 covers studies that have 
been carried out at the elementary school level, 
while Table 3 deals with studies at the secondary 
school level. Even these tables are somewhat se- 


lective as they do not include many of the weaker 
experimental studies nor any of those publications 
that present opinions or personal experience rather 
than research evidence. 

In our discussion, we will point out a number of 
weaknesses and limitations. These criticisms are 
not made with the intention of minimizing the impor- 
tance of the knowledge that these studies have given 
us, but instead attempt to provide the reader who is 
unfamiliar with this research areawith some in- 
sights that will help him place the various studies in 
their proper perspective. 


Elementary School Level: Although completed 30 
years ago, Hartill’s research (58) is probably the 


best designed work published to date on the relative 

merits of homogeneous and heterogeneous grouping 

at the elementary school level. His subjects includ- 
ed 1374 pupils enrolled in grades 5A, 5B, and 6A. 
On the basis of educational age, one-half of the sub- 
jects were grouped homogeneously into three ability 
levels while the other half were assigned to hetero- 
geneous groups. Pupils in the homogeneous groups 
received a differentiated course of study aimed at 
their ability level, while those in the heterogeneous 
groups received an undifferentiated course of study. 
After the subjects had been exposed toone-half year 
in the homogeneous or heterogeneous classes, they 

were again tested for achievement. 

The following fall, when these pupils returned to 
school, those who had been in homogeneous groups 
during the previous school term were placed in het- 
erogeneous groups, and vice versa. Teacherswere 
also advanced so that in a given school the teachers 
who had taught 5A children who were grouped ho- 
mogeneously would teach the same childreningrade 
0B when they were placed in heterogeneous groups. 
At the end of the fallterm, all pupils were again 
tested for achievement. An overall analysis of the 
achievement results for pupils when in homogeneous 
and heterogeneous groups showed no significant dif- 
ferences between the gains made by these pupils un- 
der the two conditions. When pupils were subdi- 
vided in terms of ability level, however, it was found 
that the superior pupils made larger gains in 
achievement areas tested (except reading) when un- 
der the heterogeneous grouping system. Average 
pupils made larger gains when under the homogene- 
ous grouping system, and slow pupils did equally 
well under the two systems. It should be noted that 
superior pupils in the homogeneous groups were 
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TABLE 2 


ABILITY GROUPING 


ABILITY GROUPING STUDIES CONCERNED WITH ACHIEVEMENT AT THE ELEMENTARY SCHOOL LEVEL 


Length School 
Grade of Grouping Grouping Subjects 
Study Levels N Study Levels’ Criteria Measured Results Remarks 
Jones, McCall 3-7 67 Pairs 2 years Superior IQ Reading Slight advantage for AG sup. AG pupils elected to enter 
(1926)4 only Arithmetic groups, not sig. accelerated classes, thus 
Spelling biasing sample. Matched 
pairs used. Curriculum 
differentiated for AG pupils. 
Worlton 4-7 3700 1/2 3 IQ Reading AG classes made greater Steps taken to avoid differ- 
(1926) AG and Arithmetic gains in composite achieve. entiation of curriculum for 
RG Spelling score (no mention of stat. sig.) different ability levels. 
Baird 4-8 325 in 3 Ach. Gen. Ach. 8 of 10 control classes achieved 
(1927) AG + 10 Test more (no mention of stat. sig.) 
classes RG 
Walton 4-7 1466 IQ Teach- Reading Slight advantage for AG sup. 1466 pupils above 90 IQ 
(1928) Levels er Judg- Spelling groups on composite achieve. were tested. Pupils were 
ment Arithmetic score (no mention of stat. sig.) classified into 6 categories 
and IQ each with a range of 10 IQ 
points. Curriculum differ- 
entiation implied but type of 
adjustments not described. 
Bonar 1 96 1 year 2 IQ and Reading Results on objective achieve. The grouping system used 
(1929) Teacher test favored RG. No data on did not result in really dif- 
Judgment stat. sig. erent ability groups. The 
median IQ of the high group 
was 104, the low group 98 
and the het. group 99. 
Dvorak, Rae 1 20 Pairs 1 year 2 IQ Reading No clear-cut dif. bet. AG The grouping system used 
(1929) Spelling and RG. Two stat. sig. dif. could not be expected to re- 
in reading scores favored AG. duce the variability of the 
class by more than a few 
percent. 
Gray, Hollingsworth Age 56 exp. 3 years Superior IQ (S.B) Reading No stat. sig. dif. found in AG pupils did extra work in 
(1931) 1-9 36 con- only Above Arithmetic regular school subjects. ‘‘En- some subjects and studied 
trol 130 Nature richment’’ subjects studied by some topics not covered by 
History AG pupils not measured. RG pupils. 
Language 
Barthelmess, Boyer 4-5 297 Pairs 1 year 3 IQ English Total AG sample was sig. Study involved 2 replica- 
(1932) and Reading higher on total achieve.for both tions. Used matched pairs 
268 Pairs Geography replications. AG pupils at all which resulted in loss of up 
Arithmetic 3 ability levels made greater to 53% of AG pupils and 77% 
gains but sig. level of these of RG pupils. 
gains not reported. 
Hartill 5-6 1374 1 year 3 Ach. Gen. Ach. Overall comparisons bet. AG Well controlled study using 
(1936) test and RG pupils showed no stat. counterbalanced design, to 
sig. dif. Sup. RG pupils and control group dif. Differ- 
av. AG pupils made greater entiated course of study used 
gains. No dif. between slow with AG sup. pupils being 
groups. given enrichment program 
rather than acceleration. 
Rankin, et. al. 3-6 About 500 2 years 3 IQ Reading On composite score of 4read- Twodif. AG plans compared 
(1936) in each Arithmetic ing tests, vertical plan best at with RG plan. Vertical plan 
plan all ability levels. Detroit plan cut across grade levels to 
higher than RG for composite form AG; Detroit plan 
and bright pupils but these 2 grouped within grade levels. 
plans about same for av. and 
slow pupils. In arithmetic De- 
troit plan best, vertical plan 
next and RG lowest for all abil- 
ity levels. Dif. greatest for 
sup. , smallest for slow groups. 
Kvaraceus, Wiles 3 75 1 year 3 Ach. Re ading ‘‘Better than av. pupil growth’’, Pupils grouped separately 
(1938) test English no mention of stat. sig. in reading, English, and 
Arithmetic Arithmetic and grouped het. 
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Length School 
Grade of Grouping Grouping Subjects 
Study Levels N Study Levels Criteria Measured Results Remarks 
Russell 4-6 278 exp. 2 years 3 Reading Reading No stat. sig. dif. in gains for Involves AG in reading only 
(1946) 248 Ach. either the entire groups or for with pupils in het. groups 
control pupils at dif. ability levels. for rest of school day. A 
well designed study. 
Edmiston,. et. al. 0-6 462 1 year See IQ Reading Wide range group made sig. Compared ‘‘wide range’’ 
(1949) Remarks Range greater reading gains. groups (IQ Range 41) and 
‘‘narrow range’’ groups 
(IQ range 29). Curriculum 
not differentiated. 
Husen, Svensson 5-6 2755 2 years 2 School Reading As analysis was broken down This study was carried 
(1960) marks Grammar by treatment, social class, and out in Sweden. 
based on Spelling ability level, a brief summary 
Stand- Arithmetic of results is not possible. Few 
ard Test sig. dif. found bet. treatments 
for pupils of comparable ability. 
Provust 4-6 254 1 year 3 IQ Arithmetic Overall comparison gave sig. Experimental group in AG 
(1960) greater gain for AG. Analysis class for arithmetic only; 
by level showed AG sup. made in RG class for rest of day. 
greater gains. No dif. bet. AG Study used matched pairs. 
and RG av. or slow pupils. 
West, Sievers 5-6 16 exp. Superior Teach- Language Mean gains of 2 to 5 months Matched pairs design. Use 
(1960) 16 only er Judg- Art and for AG over RG pupils. of teacher judgment prob- 
control ment Arithmetic ably introduced bias favor- 
and Abil- ing experimental group. Ex- 
ity Test perimental group was in 
special AG class half day 
and RG class remainder of 
day. 
Daniels Junior About 4 years 3 IQ and English 16 dif. bet. means for one No data givenoncurriculum 
(1961) School 500 previ- Reading pair of schools, 4 for another differentiation is AG schools. 
1-4 yrs. cases. ous Arithmetic pair of schools. All 20 re- Groups matched for mean 
Varies grades ported sig. favoring RG and SD of IQ. Author appar- 
from schools. No breakdown of data_ ently used one-tailed test of 
year to by ability level. sig. as CR’s as low as 1. 75 
year. are reported sig. at .05 
level. 
Goldberg, et. al. 4 2219 1 year 4) IQ Reading Of 105 comparisons, there 5 ability levels combined in- 
(1961) See Arithmetic were 11 that were stat. sig. to classes to provide 15 pat- 
Remarks Language 10 of these favored ‘‘broad terns differing in ability lev- 
Arts,Work range’’ patterns i.e. het. els and degree of hom. Cur- 
Study grouping. riculum not differentiated. 
Koontz 4 191 to 1 year 5) Arith- Arith- RG sample achieved sig. more AG and RG pupils matched 
(1961) 231 for metic metic in reading and arithmetic. No on initial achieve. Analysis of 
dif. Reading Reading sig. dif. in language. No sig. variance used to analyze re- 
subjects Lan- Lan- interaction bet. treatment and sults. Hawthorne effect 
guage guage level. probably favored AG. 
Simpson, Martinson 5-6 295 in 1 year Superior IQ (SB) Arithmetic Both the special enrichment This study involves a wide 
(1961) grades only. Above Reading group and special hom. group’ variety of treatments for 
5-6 130 Writing achieved sig. more thancon- sup. pupils in grades 1-12. 
Listening trol group on gen. ach. All treatments are compared 
Social with control groups but treat- 
Studies ments are not compared with 
each other. 
4Taken from Ekstrom, (1959). Primary source not available. 
Dapbbreviations: 
AG = ability grouped 
RG = random grouped 
stat. = statistical, statistically 
Sig. = significance, significant 
sup. = superior 
av. = average 
dif. = difference(s), different 
hom. = homogeneous 
het = heterogeneous 
ach = achievement (also achieve. ) 
bet = between 
gen = general 
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given an enrichment program rather than an accel- 

eration program, and were, in fact, held back toa 

normal degree of growth on the fundamentals so as 
to permit more time for enrichment. Hartill con- 
cluded that the homogeneous plan was superior be - 
cause bright pupils did nearly as well on the funda- 
mentals and in addition had time to study the enrich- 
ment materials, average pupils gained significant~ 

ly more, and slow pupils did just as well under this 
system. 

Hartill’s research is the only study found in the 
area of ability grouping in which a counterbalanced 
research design has been used. This design has the 
great advantage of cancelling out differences between 
experimental and control groups because each pupil 
is measured in both treatments during the two 
phases of the study. Hartill’sdesignalso controlled 
teacher differences, by having the same teachers 
move ahead one-half grade so that within a given 
school, the same teachers would teach both homo- 
geneous and heterogeneous classes. 

Another of the early studies that involves large 
samples and gets at basic questions with a reason- 
ably sound research design was carried out by Ran- 
kin, et al. , (93). This study involved about 500 pu- 
pils in each of three grouping plans, and compared 
achievement gains on four reading tests and three 
arithmetic tests over a two-year period. The three 
grouping plans used were: 1) the ‘‘vertical plan” in 
which pupils were grouped on three ability levels 
combining as many as three half-grades to make up 
a class, 2) the ‘‘Detroit plan’’ which also employed 
three ability levels, but did not cut across grade 
lines, and 3) the ‘‘mass instruction plan’’ in which 
no attempt was made to establish homogeneous 
groups. Curriculum adjustments were made inboth 
of the ability grouping plans although these adjust- 
ments were greater in arithmetic under the Detroit 
plan than under the vertical plan. In the mass in- 
struction plan ‘‘...a deliberate attempt was madeto 
teach all pupils in the same way. This was done by 
using the same material and the same methods for 
all and attempting to hold all pupils to the same 
standards. Standard Detroit materials were re- 
placed in these schools by other materials better 
adapted to mass instruction. ’’ (p. 279). 

The results of this research are given in terms 
of gain scores over atwo-year period. Gains are 
given for all pupils on each of the tests andare also 
presented for bright, average, and slow groups sep- 
arately. Although the research report presented no 
data on statistical significance, the information pro- 
vided is sufficient for calculation of t-tests between 
some of the final means. These t-tests were cal- 
culated for the Stanford Reading Test, which was 
weighted about half of the composite reading score. 
The results on the Stanford Reading Test comparing 
all pupils regardless of ability levels showed the 
vertical plan to be superior to both the Detroit Plan 
and mass instruction plan with the differences sig- 


nificant beyond the .Q1 level. Differences between 
the Detroit and mass instruction plan were not sta- 
tistically significant. When comparing bright pupils 
only on the Stanford Reading Test, the results again 
showed that the vertical plan resultedin greater 
gains than the other two plans, with these differ- 
ences also significant at the .01 level. The pattern 
was somewhat different, however, for the slow 
group. Slow pupil gains under the vertical planwere 
Significantly higher at the . 05 level than under the 
Detroit plan, but were not significantly different 
from the mass instruction plan. Again, the Detroit 
and mass instruction plans were not significantly dif- 
ferent from each other. It is interesting to note that 
there were no significant differences between heter- 
ogeneous grouping (mass instruction) and homogene- 
ous grouping within grade level (Detroit plan). Those 
Significant differences found favored the more ex- 
treme system of homogeneous grouping across grade 
levels that was employed inthe vertical plan. 
T-tests were also calculated by the reviewer on the 


Detroit Arithmetic Test which was weighted about 


one-half of the composite arithmetic score. None of 
the differences between the three grouping treat- 
ments on arithmetic gains were statistically signifi- 
cant. 

Probably the greatest limitationinthe findings of 
this study was brought about by the removal of all 
enrichment materials, even those normally used by 
the district, from the heterogeneous classrooms in- 
volved in the ‘‘mass instruction plan’’. Intaking 
this step, the investigators created an unusual dif- 
ference between the treatments in the amount of sup- 
plemental materials available, and as aresult, the 
achievement differences obtained can be considered 
only partially attributable tothe grouping treatments. 
The fact that reading differences were much larger 
than arithmetic differences suggests that the absence 
of material for differentiating the curriculum migat 
have nad greater influence on the results than tne 
grouping treatment, because the teaching of reading 
without any materials available for adapting to in- 
dividual differences presents the teacher with a far 
more difficult task than teaching aritnmetic under 
these conditions. In spite of this limitation, how- 
ever, this is one of the best of the comparative stud- 
ies because it involves reasonably large samples, 
uses reliable objective measures, breaks down the 
effect of the treatments for pupils of different ability 
levels, and avoids the pitfalls of using matched pairs 
of subjects. 

Among the more recent studies, the work of Dan- 
iels (29) is one of the most extensive at the elemen- 
tary level. In this study of fcur British junior 
schools (grades 2-5 in our system), two employed 
three-level ability grouping based on IQ and two em- 
ployed random grouping. Although little information 
is provided on the extent of curriculum adjustment 
in the ability grouped schools, the report indicated 
that the four schools nad comparable curricula and 
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the random grouped schools used whole group in- 
struction rather than enrichment or intraclass group- 
ing. Schools A and B employed random grouping 
while schools C and D employed ability grouping or 
‘‘streaming’’ as the English call it. Schools A and 
C were found to be comparable on a number of per- 
tinent variables as were schools Band D. Group 
matching procedures were used to equate the groups 
for mean and standard deviation of IQ. This match- 
ing resulted in losing relatively few cases. Three 
achievement areas; reading, English and arithmetic, 
were tested at the start of the study and at intervals 
during the study. For schools A andC, the com- 
bined number of cases for different years ranged 
from 262 to 297. For schools B and Dthe combined 
N ranged from 197 to 227. 

In comparing mean achievement differences inthe 
three subject areas between school A and school C, 
Daniels reported all 16 critical ratios between the 
two treatments at various intervals ranging from a 
half year to four years to be statistically significant. 
All of these differences between means were in fav- 
or of school A, the school that employed random 
grouping. Only four comparisons between schools B 
and D are reported in the achievement areas but all 
four of the critical ratios obtained were statistically 
significant favoring school B, the random grouped 
school. All data were analyzed for total groups 
rather than being broken down by ability levels. 

Daniels used the rather unwieldy procedure of re- 
matching his groups at various intervals in order to 
measure gains at different periods over the four 
years. For example, to measure the significance 
of gains during the third and fourthyears in English, 
he rematched his groups on the English test scores 
given in the second year and thencompared the 
mean English test scores that the groups so matched 
earned during the fourth year. Analysis of covari- 
ance would have been a far simpler and more effec- 
tive tool for this study. In general, however, Dan- 
iel’s study appears well designed and carefully car- 
ried out. If the data had been analyzed separately 
for pupils at different ability levels, this study would 
have provided a great deal more useful information. 

Another recent study by Goldberg, et al., (46), 
is by far the most extensive project to date aimed at 
evaluation of ability grouping per se, thatis, without 
any attempt at adjustment of the curriculum for 
groups of different ability levels or degrees of ho- 
mogeneity. This research was carried out in the 
New York Public Schools starting in spring of 1956. 
Subjects were selected from the fourth grade level. 
Five ability levels were identified, ranging from 
Group A with IQs of 130 or higher to Group E having 
IQs of 99 and lower. Classes were organized so as 
to contain different combinations of the various abil- 
ity levels. For example, Pattern 1 contained only 
pupils with IQs of 130 and above (Group A); Pattern 
2 included pupils from both groups A and B; Pattern 
3 included pupils from groups A, B and C and so on. 


A total of 86 classes were formed which were class- 
ified into 15 patterns representing different combi- 
nations of pupils at the five IQ levels. A wide range 
of measures were administered to the sample in- 
cluding academic achievement in reading, arithme- 
tic, language arts, work study skills as measured 
by the SRA Achievement Series, and science and 
social studies as measured by the Stanford Achieve- 
ment Tests. Findings of this study in general indi- 
cate that when no attempts are made to adjust the 
curriculum for different ability levels, the process 
of narrowing the ability range in the classroom is 
unlikely to raise pupil achievement. In this re- 
search, as a matter of fact, Pattern 5, which rep- 
resented the broadest ability spread among the 15 
patterns, appeared to be most consistently associa- 
ted with greater academic gains for all pupils. The 
achievement differences between patterns, however, 
tended to be small with no one pattern or combina- 
tion of patterns resulting in better achievement for 
all pupils in all subject areas. 

The findings of this study indicate that achieve - 
ment was influenced more strongly by the teacher 
and group differences in individual classrooms than 
by the ability range of the class. Advocates of het- 
erogeneous grouping have often argued that the 
presence of superior pupils tends to lead to greater 
achievement by pupils of lower ability. This con- 
tention was tested in Goldberg’s study and it was 
found that only in science was the presence of gifted 
pupils consistently related to greater increments for 
the other four ability levels. In other school sub- 
jects measured, the presence of gifted pupils was 
not related to greater achievement gains for non- 
gifted pupils. 

A further analysis of the data was accomplished 
by combining the 15 ability patterns into three broad 
categories: narrow range, medium range, and 
broad range classes. An analysis of variance indi- 
cated that the ability range in the classroom had a 
significant bearing on the achievement in social stud- 
ies, reading comprehension, vocabulary, arithme- 
tic,” and total average achievement; while science, 
language arts, and work study skills were unaffected 
by the range. In the six academic areas affected, 
overall comparisons showed greater gains for pupils 
in the broad range classes as compared with the 
medium or narrow range patterns. When each abil- 
ity level was taken separately, the effects of range 
were inconsistent, although revealing some signifi- 
cant differences. Of the 105 possible comparisons 
between pupils in the five ability levels, the three 
range patterns and the seven academic areas, only 
11 reached significance, and in 10 of these the broad 
range was superior to either or both of the other two 
ranges. 

This study also explored the question of whether 
the pupil’s position in his group in terms of ability 
was related to his achievement. This is an impor- 
tant question because one of the commonly used ar- 
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guments against ability grouping is that it deprives 
the slow pupils of the chance to learnfrom more 
able classmates. The analysis showed that in most 
subjects the presence of one or two levels of more 

able classmates and no slower ones did not have any 
positive effect upon achievement. 

The research carried out by Goldberg and her 
associates is very well designed andinvolved a 
thorough and sophisticated analysis that deals with 
most major questions concerned with class ability 
level and degree of heterogeneity as itrelates to 
achievement. 

When the overall results of the study are consid- 
ered, one must conclude that variations in the abil- 
ity level, the ability range, and the position of in- 
dividual pupils within the ability range inclasses 
where no specific curricular adaptations are made, 
has little effect upon pupil achievement. Inother 
words, ability grouping per se offers no achieve- 
ment advantages over random grouping and appeared 
less effective than broad range grouping under the 
conditions present in Goldberg’s study. This is one 
of the very few areas related to ability grouping in 


which scientific evidence from an extensive and well- 


designed study seems to have presented a clear and 
reliable answer. 


Secondary School Level: Nearly all studies con- 
cerned with ability grouping versus random group- 
ing at the secondary level have deficiencies that 
raise serious questions about the validity of their 
findings. We will limit this review to a few of the 
recent studies that appear to provide reasonably 
valid research evidence. The reader is referredto 
Table 3 for a brief summary of the major studies 
concerned with ability grouping at the secondary 
school level. 

Starting in 1956, an experimental four-track cur- 
riculum was established in theWashington,D.C. 
high schools. The following year this program was 
extended to the eleventh grade and in19958 to the 
twelfth grade. When the first class progressing 
through the entire sequence had graduated, an evalu- 
ation of the program was carried out and reported 
(57). This four-track program was essentially an 
ability grouping program designed toreduce the 
ability range of each classroom and increase effi- 
ciency of instruction by differentiating the curricu- 
lum. The four tracks included the honors track 
that was restricted to pupils above 115 IQ, the col- 
lege preparatory track, the general track, and the 
basic track. The honors track was subjected to an 
extensive comparative evaluation in which seniors 
from the class of 1958 who were not inthe track 
program were compared with seniorsfrom the class 
of 1959 who were in the honors track during their 
high school years. The study involved the top 2 70 
graduates of each year, the groups being selected 
so as to give an even proportion of boys and girls 
and very similar composition in terms of IQ. Both 


groups had been administered the Iowa Tests of Ed- 
ucational Development during their senior year. 
The median score for the honors track students of 
1959 was five points higher than the comparison 
group of 1958 in ‘‘correctness of expression’’, three 
points higher in ‘‘quantitative thinking’, and six 
points higher in ‘‘science achievement’’. The two 
groups were also compared on the College Entrance 
Board Examination. The honors program groupwas 
37 points higher on their median scholastic aptitude 
score, and 30 points higher on their median mathe- 
matics score. Although the research report does 
not give levels of statistical significance for these 
differences, it appears likely that all of the differ- 
ences reported are significant. This study presents 
rather sound evidence in favor of a highly differen- 
tiated curriculum for superior students. Athorough 
analysis of the other three tracks as they compare 
with heterogeneous grouping would have added great- 
ly to the value of the study. 

Probably the best controlled and most informa- 
tive study concerned with ability grouping at the sec- 
ondary level that has been reported to date isthe re- 
search recently completed by Drews (31). Although 
limited in some respects in that pupils were studied 
at only one grade level, and in one subject area, 
this research used exceptionally good design and as- 
sembled data on a number of important variables 
that have not been explored in previous research. 
The goal of Drews’ research was to determine the 
effects of heterogeneous and homogeneous grouping 
upon pupils at three ability levels inninth grade 
English. The criteria for establishing the pupil’s 
ability level were IQ, reading skills, and language 
skills. Placement of pupils into the two treatments 
and the three ability levels was carried out using 
stratified random sampling prior to the start of the 
school year. Tests and questionnaire datawere col- 
lected at the beginning and end of the year, anddata 
were also gathered in observations, three being 
made of each class involved in the study. The final 
sample for whom all major data were collected in- 
cluded atotal of 432 students. Because both the 
homogeneous and heterogeneous groups were divid- 
ed into three ability levels, and each ability level 
was divided by sex for the purposes of analysis, 
some of the sub-samples were small. 

Eight teachers were involved in the experiment: 
four experienced and four inexperienced. Asarule, 
each teacher taught two homogeneous classes at dif- 
ferent levels and one heterogeneous class, thus help- 
ing to cancel out possible differences inteaching 
ability. Experienced and inexperienced teachers 
taught equal proportions of the homogeneous and het- 
erogeneous classes. All teachers met with Drews 
for a 90 minute period every two weeks throughout 
most of the experimental year inaneffort tode- 
velop a common viewpoint on the methods and cur- 
riculum adjustments to be made inthe homogeneous 
and heterogeneous classes. Teachers attempted to 
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adapt the curriculum in both homogeneous and het- 

erogeneous classes. Some accelerationwas car- 
ried out in the superior homogeneous classes, but 

the main goal of the differentiated curriculum at all 
ability levels was enrichment. One of the outstand- 
ing aspects of Drew’s study was the great effort 
aimed at helping teachers developa well-differenti- 

ated curriculum for use in both homogeneous and 
heterogeneous classrooms. 

Initial and final achievement were measured in 
Drews’ study using the reading and language tests 
from the California Achievement Test Battery. The 
advanced form was used for superior pupils, the in- 
termediate form for average pupils, and the ele- 
mentary form for slow pupils. Inaddition to the 
reading and language tests given at all ability levels, 
superior students were also administered tests of 
critical thinking and dogmatism and average pupils 
were administered tests of problem solving and dog- 
matism. The slow group was given an individual 
reading test in addition to the group tests of reading 
and language. Initial and final test scores were an- 
alyzed using the t-test to compare homogeneous and 
heterogeneous sub-samples of the same sex and abil- 
ity levels. In reading comprehension, none of the 
comparable groups were significantly differenton 
either the initial or final testing. The same com- 
parisons of the pre- and post-tests of language 
achievement yielded the same results, although 
some of the differences obtained approached statis- 
tical significance at the .05 level. Negative results 
were also obtained on the critical thinking tests, 
problem solving tests, and dogmatism scale. Thus, 
it may be concluded from Drews’ study that homo- 
geneous grouping in ninth grade English has no sig- 
nificant effect upon the achievement areas measured 
at any of the three ability levels. In addition to the 
achievement data given above, Drews’ study also 
dealt with a number of other important variables 
that will be discussed in the non-cognitive sectionof 
this report. 

Drews’ study, although very well designed, had 
two limitations worthy of mention. One of these, 
brought out by Drews herself, was that the achieve- 
ment measures used were aimed primarily at meas- 
urement of facts, while the type of program that 
Drews was attempting to stimulate might be expect- 
ed to develop many aspects of achievement not 
measured by a fact-oriented test. The study col- 
lected other data that were aimed at measurement 
of these higher goals, but this weakness could have 
been further overcome by using anachievement bat- 
tery such as the Sequential Tests of Educational 
Progress, which places more emphasis upon higher 
cognitive processes than does the California Achieve- 
ment Test. 

Another minor weakness of Drews’ study was her 
use of the t-test for measuring both pre- and post- 
test differences, rather than the use of analysis co- 
variance which would have adjusted the post-test 


mean scores for pre-test differences that she found. 
Although it is doubtful that the use of analysis of co- 
variance would have made any major change in 
Drews’ results, it would appear from studying her 
data that two or three of the post-test differences 
would have reached statistical significance if this 
analysis technique had been employed. Even con- 
sidering these limitations, Drews’ study is excep- 
tionally well done and probably provides more sound 
scientific evidence on the effects of ability grouping 
at the secondary level than all previous research 
combined. 


Critical Evaluation 


A critical evaluation of studies concerned with 
ability and achievement reveals a number of weak - 
nesses. Perhaps the most serious of these is that 
many studies are of only one year duration. In most 
of these studies the ability grouping method is the 
‘‘new’’ or ‘‘experimental’’ treatment. This situation 
tends to lead to a significant Hawthorne Effect which 
may well explain all or most of the differences found 
in these studies. 

Another weakness of many of these studies is the 
use of inadequate analysis techniques. Many of the 
early studies and some of the more recent ones re- 
port no tests of statistical significance but merely 
indicate that one or the other group did ‘‘better’’ or 
‘‘achieved more’’ (85, 123, 63, 12, 96). 

Billett (10), in one of the early reviews of re- 
search in this field indicated that, of the studies re- 
ported between 1910 and 1928, only four made rea- 
sonable attempts to control pertinent variables. 
Other investigators have been content to make asin- 
gle comparison between ability grouped and random 
grouped samples and have made no attempt to ap- 
praise possible differential effects of ability group- 
ing versus random grouping for pupils at different 
ability levels (29, 121). As there is no reason to 
assume that ability grouping affects superior, aver- 
age, and slow pupils in the same way, these overall 
comparisons can have little real meaning. It is at 
least hypothetically possible, for example, for su- 
perior pupils to make much greater gains in ability 
grouped classes, average pupils to do equally well 
in either grouping situation , and slow pupils to make 
much greater gains in random grouped classes. In 
this case, if pupils of different ability were lumped 
together for analysis, the lower relative gains of the 
slow pupils would cancel out the higher gains of the 
bright pupils and lead the investigator to conclude 
that the two grouping systems brought about no dif- 
ference in achievement. Thus, such gross compar- 
isons can give us little insight into what really hap- 
pened in an experiment. 

A design weakness in many ability grouping stud- 
ies is the use of matched pairs of subjects (91, 66, 
77). Many of the deficiencies of matching techniques 
such as the tendency for matched groups to be biased 
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with respect to variables other than the matching 
variable, and the biases introduced by loss of cases 
because of non-random factors, were generally un- 
known when many of the earlier studies were car- 
ried out. These deficiencies, however, have been 

generally recognized by researchersfor the past 15 
years, but, nevertheless, a study reported in 1960 

(90) employed matched pairs which resulted in the 

loss of about one-half of the cases involved in the 
experiment. 

A number of other studies (66, 59) have used 
techniques for selecting pupils for ability grouped 
classes which bias the sample and seriously distort 
the findings. Ina study by Justman (67) for exam- 
ple, pairs of intellectually gifted pupils from ‘‘spe- 
cial progress’’ and ‘‘normal progress’’ classes 
were matched on a number of variables including 
grade level, sex, chronological age, and mental age. 
Pupils for the ‘‘special progress’’ classes which, in 
this case, were homogeneous groups of superior pu- 
pils, however, had been especially selected as 
those ‘‘... who show superior academic achieve- 
ment and who also possess personal characteristics 
of initiative,enthusiasm, willingness to work, reli- 
ability, regular attendance, and capacity for sus- 
tained work. ’”’ (67). As all of these variables have 
an obvious relationship to achievement, and as the 
pupils assigned to homogeneous ‘‘special progress’”’ 
classes were especially selected for possessing 
these characteristics, it is hardly a surprise to 
learn that these pupils made significantly higher 
achievement scores at the end of the study than did 
their equally intelligent but less motivated contem- 
poraries, who remained in the normal progress 
classes. These results become more suspect when 
we learn that no attempt was made to compare the 
groups on achievement at the start of the study, at 
which time the ‘‘special class’’ group was probably 
already superior in the achievement areas meas- 
ured. Thus, although the study was well executed 
except for the aforementioned designerrors, it 
seems unwise to place any confidence in Justman’s 
findings in view of these errors. 

Westmayer (118) appears to have introduced 
a Similar bias into his research. He compared su- 
perior pupils who elected to take aspecial advanced 
problem class in science with pupils of similarly 
high ability in heterogeneous science classes. Al- 
though pupils were matched on the basis of grade 
point average for previous high school work, it would 
appear that the factors that lead some pupils to vol- 
unteer for an advanced problems course are related 
to motivation and interest in science and therefore 
would be expected to be related to science achieve- 
ment. The results of this study showed that the 
homogeneous group selecting the advanced problems 
class obtained significantly higher scores atthe end 
of the study on a standardized achievement test. It 
is impossible to determine, however, how much of 


this difference was due to ability grouping, and how 
much was due to sampling bias. In evaluating re- 
search projects in this area, it is probably safe to 
assume that when pupils who elect to take a special 
class are compared with equally intelligent pupils 
who have not elected to take the class; or when pu- 
pils are placed in special homogeneous groups be - 
cause they possess special characteristics not pos- 
sessed by pupils in the control group, the sampling 
bias so introduced will account for much of the dif- 
ference found. 

While the aforementioned studies suffer princi- 
pally because of the lack of sufficient experimental 
controls, the findings of some studies found in the 
ability grouping literature suffer from having been 
controlled too much. Ability grouping, as has been 
stated earlier, is essentially aimed at providing a 
classroom situationinwhich more adequate curric- 
ulum adjustments for individual differences may be 
made. Thus, studies of ability grouping that do not 
make some provisions for differentiating content 
and method for the different levels of ability cannot 
produce evidence on some of the major questions in 
this area. Several of the earlier studies including 
those carried out by Ullrich (115), Burtt, Chassel, 
and Hatch (21), and Worlton (122) attempted to keep 
course content and method identical for all ability 
levels and for both homogeneous and heterogeneous 
classes in their studies. These efforts at control, 
of course, tend to hold constant the very area that 
should be manipulated in the ability grouping experi- 
ment. Such studies are valuable in answering ques- 
tions about ability grouping per se, but tell us noth- 
ing about the effects of ability grouping under the 
conditions that it is used in the field — that is, with 
curriculum differentiation. Some recent studies 
have attempted to study the merits of ability group- 
ing per se by developing or identifying classes that 
vary in terms of homogeneity and ability level and 
then compare achievement for these classes. A 
study of French (37) employed this technique of lo- 
cating and comparing classes already in existence 
which happen to differ in degree of homogeneity, 
while the research reported by Edmiston and Ben- 
fer (33) manipulated the degree of homogeneity while 
making no other adjustments. The extensive re- 
search of Goldberg, et al, (45) discussed earlier in 
this chapter was also aimed at evaluating ability 


grouping per se. 


RESEARCH RELATED TO UNDERACHIE VE MENT 


In addition to studying general achievement inthe 
two grouping systems, the Utah study was concerned 
with the number of overachievers and underachiev- 
ers in the two systems. No research has been re- 
ported that related overachievers or underachievers 
to ability grouping, but a number of studies on the 
nature of underachievement have been reported. 
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Some of these studies will be briefly summarized to 
give the reader some insight into this topic. 

The studies reviewed have beendivided into 
those dealing with environmental factors that might 
relate to underachievement such as family back- 
ground, home conditions, social position of parents, 
community and neighborhood environment, compan- 
ions and age preferences, counseling procedures, 
teacher influence, and social acceptance; and into 
those dealing with personal factors that might re- 
late to underachievement such as aspirations, self- 
concept, emotional disturbance, health, intelli- 
gence quotient, goals and achievement motives, in- 
terests, values, and occupational choices. 

Factors brought out in the literature on under - 
achievement may be summarized as follows: 

Environmental factors relating to underachieve- 
ment that have been mentioned in the literature in- 
clude: 

1. Lack of good models in education and oc- 
cupation in those they admire in home and 
community (44, 101). 

Low educational level of family (44). 
Unstable family structure (44). 


Ethnic and religious prejudice (44). 

Rejection often by classmates for personal 

or caste reasons (6, 86). 

7, Often over-age in their peer group, caus- 
ing rejection (6). 

8. Environment too complicated for their 

ability (73). 


Oo eS 


Lower socioeconomic status (61, 40, 79, 90). 


9. Have often received coercive influence 
from teachers or others (35). 
10. Suffer more pressures (48). 
11. Blocks to learning have been set up (48). 
Personal factors related to low achievement and 
underachievement that have been mentioned in the 
literature include: 

1. Underachievers dislike or fear people - 
especially parental or authoritative fig - 
ures, siblings, and peers (44, 120). 

2. Unable to create warm relationships with 
others (20, 16). 

3. Have lower or unrealistic levels of aspira- 
tion (120, 104). 

4. More often follow vocational goals set for 
them by others (2). 

0. Have fewer vocational and other interests 
(2, 73). 

6. Have stated goals not in line with their 
dominant interests (2, 73). 

7. More prone to develop mental illness; 
more emotionally disturbed; have more 
difficulty in making non-academic adjust- 
ments (64, 109, 114, 16). 

8. Have lower threshold of frustration (35). 
9, Have lower intelligence quotient (64, 60). 
10. Have low self-concept of ability (18, 44, 16). 

11. Prefer to avoid conflicts (2). 

12. Less cooperative, less dependable, have 
poorer judgment (2). 

13. Prefer outdoor activity (2). 

14, Prefer companions older than self (2). 
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CHAPTER 3 


GENERAL ACHIEVEMENT 


NOW THAT we have reviewed previous research 
relating achievement to ability grouping let us ex- 
amine that portion of the Utah study that is con- 
cerned with this topic. 


WORKING HYPOTHESES 


The working hypotheses on achievement which 
formed the basis for designing this phase of the pro- 
ject were founded primarily upon psychological ra- 
tionales developed by the author or postulated inthe 
writings of other psychologists and educators. 
Previous research was also used as a basis for de- 
veloping working hypotheses, but because of the de- 
sign limitations characteristic of most previous 
work, one finds this work a poor foundation up- 
on which to build hypotheses. The majorworking 
hypotheses relative to achievement were: 

1. Superior pupils will achieve more in ability 
grouped classrooms in which the curriculum is ad- 
justed primarily through changes in learning rate 
than they will in random grouped classrooms in 
which enrichment is the primary means of curricu- 
lum adjustment. This hypothesis has considerable 
basis in the findings of previous research. A num- 
ber of studies such as the work of Provust (90), 
West and Sievers (117), and Herr (59) for example, 
have found greater achievement gains for superior 
pupils in ability grouped classrooms than for supe- 
rior pupils in hetergeneous classrooms. 

Educators have also made certain assumptions 
about the ability grouping situation whichwouldtend 
to lead one to expect superior pupils to make great- 
er gains under this treatment. It is often maintained, 
for example, that the superior pupilis seldom chal- 
lenged in the random grouped classroom and asa 
result, tends to work at a level far below his poten- 
tial. In contrast, the ability grouped classroom is 
looked upon as providing a situation in whichthe su- 
perior pupil is faced with an accelerated program 
and more intense competition from his peers and 
these factors tend to motivate him to higher achieve- 
ment. 

2. With respect to the achievement of average 
pupils in ability grouped and random grouped class- 
rooms, it was our working hypothesis that no signif- 
icant differences would emerge. Both the findings 
of previous research and the rationale under lying 
ability grouping tend to lead to this hypothesis. 
A number of studies including Worlton (123), Pro- 
vust (90) and Billet (9), have obtained greater differ- 


ences in achievement for pupils of high or low abil- 
ity than were found for pupils around the center of 
the ability distribution. A frequent criticism of het- 
erogeneous grouping is that the curriculum is 
geared to the average pupil. If this is the case, the 
average pupils would have the least to gaininan abil- 
ity grouped situation because the random grouped 
situation is already adapted to his level. Some ed- 
ucators have maintained that the average pupil suf- 
fers in the ability grouped situation because he is de- 
prived of the assistance of superior pupils in the 
classroom situation. Some research, however, (74) 
suggests that little real contact takes place between 
pupils of different ability levels, thus minimizing 
the possible gains that could occur through interac- 
tion between pupils of different ability. 

3. It was hypothesized that slow pupils would 
make greater achievement gains in the random 
grouped situation. This working hypothesis was 
somewhat harder to establish than those previously 
discussed, because of a number of apparently con- 
flicting pressures and motivations that seem to be 
operating upon the slow pupil in the two grouping sit- 
uations. Some of these pressures would appear to 
raise the achievement of the slow pupilinthe ability 
grouped classroom, while others would appear to 
lower it. For example, the fact thatthe rate of 
learning in ability grouped classroomscanbetter be 
adjusted to the capacities of the slow learning pupils 
would lead one to expect these pupils to achieve more 
in the ability grouped situation. On the other hand, 
the stigma that most educators believe tobe associ- 
ated with membership in a slow group might well de- 
velop unfavorable attitudes, reduce motivation, and 
lower the self-concept and aspiration level of slow 
pupils in ability grouped classes. 

In random grouped classrooms it has been argued 
that the class goals are so far beyond the ability of 
the slow pupil that he becomes discouraged and dis- 
interested. On the other hand, it may be contended 
that the random grouped classroom ineffect pro- 
vides an accelerated situation for the slow pupil, the 
pressures of which might lead him to greater achieve- 
ment, just as similar pressures in ability grouped 
classroom are thought to motivate the bright pupil. 
The reader may have noted in the review of the lit- 
erature that studies dealing with the achievement of 
ability grouped slow pupils report somewhat conflict- 
ing results. Some studies have found no differences 
between the achievement of slow pupils in ability 
grouped and random grouped situations (85, 58, 90). 
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Others have shown relatively greater gains for slow 
pupils in ability grouped classes (9, 92). A study by 
Cook (27) found that ability grouped slow pupils 
achieved more in some subjects while random 
grouped slow pupils achieved more in others. 


PROCEDURE 


Sample 


Achievement data were collected for Samples IV, 
VI, VII, VIII, and IX. Achievement gainsfrom year 
to year as well as gains over the entire four years 
were studied for each sample. This procedure pro- 
vided a large body of data on the nature of long term 
achievement in the two grouping treatments. Also, 
because achievement data for two or more samples 
were available at most grade levels, extensive 
cross checking of the results was possible. 


Achievement Measures 


Because of financial limitations imposed during 
the pilot study year, the California Ac hie ve ment 
Test Battery already in use in the two school dis- 
tricts was employed. This battery, however, was 
not used beyond the pilot study year. Acareful 
study of all available achievement tests was made 
in order to select the battery to be used during the 
final three years of this study. The Sequential 
Tests of Educational Progress (STEP) were finally 
selected. These tests have a number of advantages 
which were not possessed to the same degree by any 
of the other achievement batteriesexamined. Among 
these advantages are the following: 

1. They emphasized broad understanding and the 
ability to utilize learned skills in solving new prob- 
lems rather than ability tohandle only specific facts, 
thus reducing the likelihood of content validity differ- 
ences for the two districts participating in the re- 
search. 

2. The battery contains vertically comparable 
tests that cover the entire grade span of subjectsin 
this study. As most of the subjects inthis research 
were administered two different levels of the STEP 
during the four years of the study, this vertical 
comparability is a highly desirable feature. 

3. Detailed studies of the norm distributions 
carried out by Educational Testing Service indi- 
cated that in every instance, the test provided ade- 
quate ceiling for better students in the schoolsfrom 
which the norm samples were drawn and in general 
provided an adequate floor for low ability students. 
This characteristic is particularly important ina 
study in which pupils of a given grade level but of 
different ability levels must be analyzed separately. 


ACHIEVEMENT RESULTS 


Most of the differences inachievement gains will 


be reported between pupils of comparable ability in 
the two districts. The analyses were designed to 
yield information most pertinent to the question: 
‘‘Do comparable pupils in ability grouped and ran- 
dom grouped classrooms make comparable achieve- 
ment gains?’ The usual analysis procedure was to 
identify pupils in District A and District R whowere 
in a given ability level classification (superior, av- 
erage, or slow) and carry out a separate analysis of 
covariance for each pair of comparable ability 
groups. 


sample IV 


The first achievement battery was administered 
in October, 1958, when Sample IV pupils were be- 
ginning the fourth grade. This sample was studied 
from the beginning of the fourth grade to the end of 
the seventh grade and provides the best evidence 
that this project has to offer concerning the effects 
of ability grouping on the achievement of the elemen- 
tary school child. The second battery was adminis- 
tered in April, 1959, and the results of this testing 
were adjusted for initial differences using analysis 
of covariance. The final adjusted overall achieve- 
ment means were significantly higher for superior 
pupils in District A as compared toDistrict R. This 
difference was also found in comparing average pu- 
pils. The difference between slow pupils in the two 
districts, however, was not significant, and in this 
case, the trend favored the random grouped slow 
pupils. 

In addition to comparisons of overall achievement 
based on the total California Achievement Test Bat- 
tery, pupils were compared‘on the basis of the 
achievement in the three subtests in the battery (see 
Table 4). Superior pupils in District A were signif- 
icantly higher than comparable pupils in District R 
on all three of the subtest scores. Comparisons of 
the average pupils in the two districts showed Dis- 
trict A pupils to be significantly higher in the lang- 
uage and reading subtests. Comparisons between 
slow pupils in the two districts on the three subtests 
of the CAT battery revealed no significant differ- 
ences. 

In comparing the first year’s achievement of Dis- 
trict A and District R, several points shouldbe kept 
in mind, One important factor that probably influ- 
enced the scores in District A is the fact that these 
data were collected during the first year that ability 
grouping was employed in this district. Interviews 
with teachers and supervisors during this year indi- 
cated that teachers had not adjusted their techniques 
and materials to the different ability levels during 
this year as in later years. Anotherfactor that 
almost surely operated to some degree inDistrict A 
during the first year is Hawthorne Effect. The abil- 
ity grouping system received considerable publicity 
and the local press favored this innovation. The 
change aroused interest and enthusiasm on the part 
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of many teachers and administrators in District A. 
Such interest usually can be expected to bring about 
some favorable change in pupil performance regard- 
less of the merits of the variable introduced. 

Achievement gains for the second year of the 
study were computed by comparing the scoresonthe 
CAT battery administered near the endof the fourth 
grade with a battery made up of four tests from the 
Sequential Tests of Educational Progress, (STE P) 
obtained near the end of the fifth grade. The STEP 
and CAT have two subtest areas in common, mathe- 
matics and reading. Gains in these two areas and 
on the overall battery scores were compared for 
this sample. Superior pupils did not differ signifi- 
cantly in mathematics gains or inoverall gains 
based upon the composite battery scores. The su- 
perior pupils in District R, however, made signifi- 
cantly greater gains in reading (t = 2. 29) than com- 
parable pupils in District A. Comparisons of aver- 
age subsamples revealed that District A pupils made 
significantly greater gains (.01 level) inreading, 
mathematics and overall achievement. Slow pupils 
in District R registered significantly greater gains 
in reading (t = 3.65) andinthe overall battery score 
(t = 4,44). The difference in mathematics gains, 
though not statistically significant, also favored the 
District R sample. These differences along with 
other achievement differences between the two treat- 
ments are summarized briefly in Table 6. 

During the third year of the study, both the initial 
and final achievement means were basedonthe STEP 
battery. It was possible during this year to analyze 
achievement gains on the overall battery score and 
also on the mathematics, science, reading, and so- 
cial studies subtests. The gains made by pupils of 
similar ability in District A as compared with Dis- 
trict R revealed no significant differences in mathe- 
matics, reading, social studies, or in the overall 
STEP battery score. Only in science did statistical- 
ly significant differences occur. These differences 
were present at all three ability levels and in every 
case favored District A. 

During the fourth year of the study, pupils inSam- 
ple IV were in the seventh grade. It willbe recalled 
that at the seventh grade level, District Aemployed 
ability grouping only in mathematics and science. 
The STEP battery from 3B was administered near 
the end of the seventh grade. Superior pupils in both 
districts were given the high schoolform ofthe STEP 
(2B) in order that there would be no chance of their 
being restricted by the test ceiling. The results re- 
vealed only one significant difference incom pari- 
sons between the ability grouped and random grouped 
seventh grade pupils of similar ability levels on the 
STEP mathematics and science tests. It was found 
that superior ability grouped pupils made signifi- 
cantly greater gains in mathematics than superior 
random grouped pupils (t = 2.21). The District R 
slow pupils made greater achievement gains in both 
tests than the District A slow pupils although these 


DO 
a 


differences were not large enough to be statistically 
Significant. For average pupils the differences be- 
tween the two districts were inconsistent and small. 


Results for the Four Years 


In addition to analyzing gains year by year for 
Sample IV, the scores on the initial C AT battery ad- 
ministered in the fall of 1958 were compared with 
the scores made by the same pupilsonthe final STEP 
battery administered in spring of 1962 (see Table 4). 
These results indicated that over the fouryear pe- 
riod, the ability grouped superior sample had made 
significantly greater overall achievement gains. For 
the average groups, there were no significant differ- 
ences, and for the slow groups, the District R pu- 
pils made significantly greater gains. For this com- 
parison the pupils were classified inaccordance 
with their ability level at the outset of the study 
while in the year to year comparisons, classifica- 
tion was based upon the pupil’s ability level at the 
beginning of the year in question. 

The year by year analyses appear to have one 
disadvantage. They fail to reflect adequately the 
cumulative effects of the two grouping treatments 
because pupils shift in their ability level classifica- 
tion from year to year if their performance justifies 
a change. For example, a child who is initially in 
the slow group in District A may be moved to the 
average group for the second year of the study and 
will thus become part of the average group during 
the second year’s analysis. There was, in fact, a 
rather large number of pupils in District A who did 
move out of the slow group. The data for the entire 
four years fill in this gap, and provide amore ac- 
curate picture of long term gains under the two 
grouping treatments. 


Sample VI 


This sample was tested at the beginning of the 
sixth grade in October ,1958 and was followed through 
the ninth grade with final achievement measures ad- 
ministered in April, 1962. Sixth grade achievement 
for this sample was measured by the California 
Achievement Test Battery. The same limitations ap- 
ply to these data that were discussed previously with 
regard to Sample IV. It may be seen in Table V that 
the initial mean scores of District A and District R 
pupils at the three ability levels were closely com- 
parable. Analysis of variance of the total battery 
scores yielded F values of .24, .67, and .52 be- 
tween slow, average, and superior pupils in the two 
districts. The final overall achievement means 
were adjusted for these slight initial differences us- 
ing covariance analysis. Adjustedfinal means 
showed that superior pupils in District A had made 
considerably greater gains (.01 level) on the CAT 
composite battery score than their peers in District 
R. Differences significant beyond the. 01 level were 
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also found between average pupils in the two treat- 
ments, again favoring District A. No significant 
differences were found between slow pupils. The 
pattern of differences for this sample is very simi- 
lar to that found in the first year performanceof 
Sample IV (see Tables 4 and 5). 

Achievement on the three specific subtests of the 
CAT: arithmetic, language, and reading were also 
compared. These results also followed a pattern 
generally similar to that found inthe Sample IV 
analysis. District A superior pupils had sig nifi- 
cantly higher final adjusted mean scores onallthree 
of the CAT subtests. These differences were all 
significant beyond the . 01 level with the arithmetic 
and reading differences amounting to over one-half 
year in terms of grade placement scores and yield- 
ing very large adjusted F’s (90. 82 and 103. 64 re- 
spectively) on the analysis of variance. It will be 
noted that the greatest difference between the two 
superior groups was in reading as was alsothe case 
for Sample IV. A comparison of average pupils in 
the two districts revealed significant differences 
favoring District A pupils in arithmetic and reading. 
These differences were much smaller than those 
found for superior pupils. No significant differ- 
ences were found between the average groups in the 
language subtest. District A slow pupils were sig- 
nificantly higher in their final adjusted language 
means, but there were no significant differences be- 
tween slow pupils in the two districts on the arith- 
metic or reading subtests. 

In order to determine achievement gains during 
the seventh grade, the CAT scores earned by Sam- 
ple VI pupils at the end of the sixthgrade were used 
to adjust the means on the STEP mathematics and 
science tests that were administered at the end of 
the seventh grade. 

The results showed the District R superior and 
slow pupils to be significantly higher than comparable 
District A pupils in mathematics at the endof grade 
seven (Fy,x for superior 4.01, for slow 11. 69). 
This difference, though not large for the superior 
sample, is noteworthy as it cancels out the rather 
large difference favoring the District A pupils that 
was found during the first year of the study. The 
STEP mathematics scores earned by the two sam - 
ples at the end of grade seven were less than one 
point apart, indicating that, when one considers the 
first two years for this sample, achievement in 
mathematics was nearly identical for superior pu- 
pils in the two districts. In contrast, the significant 
final mean differences between the slow pupils leave 
the District R sample somewhat ahead in mathemat- 
ics achievement for the initial two years. 


Because the CAT does not include ascience score, 
STEP science scores were adjusted for initial differ- 


ences using the CAT composite battery score. No 
significant differences in the adjusted final STEP 
Science mean scores were found between any of the 
comparable ability groups in the two districts. 


No achievement measures were administered to 
the secondary school sample during the third year 
of the study, so the achievement during the final two 
years of the study is covered in one analysis. Gains 
made between the end of grade seven and the end of 
grade nine on the STEP achievement battery were 
compared using analysis of covariance. No signifi- 
cant differences were found between comparable abil- 
ity groups in mathematics. A small difference fav- 
oring the District R average sample was found in 
analysis of the science gains (t = 2.26). Gains at 
the superior and slow ability levels were not signif- 
icantly different for the two grouping systems. 


Results for the Four Years 


As three of the four years covered in this analy- 
sis involved secondary school achievement, analy- 
sis of gains in mathematics and science seemed 
more appropriate than an analysis of overall gains 
such as was used in the four-year analysis of Sam- 
ple IV achievement. In mathematics, the superior 
District A sample showed a small but significantly 
greater gain over the four-year period (Fy, x=5. 82). 
This difference takes on greater importance when 
we recall that District A superior pupils inthis sam- 
ple were significantly lower in socioeconomic status 
and thus probably less motivated to achieve thanthe 
superior District R pupils. The District A average 
pupils registered a similar gain in science over 
their District R peers (Fy. x = 4. 46). All other dif- 
ferences failed to reach statistical significance. The 
Sample VI data suggest that achievement gains that 
can be attributed to the ability grouping system over 
four years are generally small. 


Sample VII 


This sample was tested at the beginning of the 
seventh grade in October, 1958, and was followed 
through the tenth grade with final achievement meas- 
ures administered in April, 1962. Along with Sam- 
ple VI, these pupils represent the best data provid- 
ed in this research concerning the effects of ability 
grouping at the junior high school level. During 
three of the four years of this study, pupils in these 
two samples were at the junior high school level. 

The first year analysis for this sample was lim- 
ited to mathematics as no science measure is in- 
cluded in the CAT battery. Comparisons of supe- 
rior pupils based on analysis of covariance yielded 
no significant differences. District R average pu- 
pils showed significantly higher final means on the 
CAT arithmetic tests than comparable DistrictA 
pupils (Fy, x = 9. 50). 

Achievement during the eighth grade for Sample 
VII was determined by adjusting end-of-year STEP 
achievement means in mathematics and science for 
initial differences found in the California Achieve- 


ment Test Battery administered at the end of the 
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seventh grade. Comparisons betweenfinal adjusted 
mean scores in mathematics showed a significant dif- 
ference in favor of the District A superior pupils 
(Fy. x = 6.18). Comparisons between average and 
slow pupils in mathematics achievement yielded no 
significant differences. The finaladjusted mean 
scores in science also showed statistically signifi- 
cant differences in favor of the District A superior 
pupils (Fy. x = 6.39). District A average pupils 
were also significantly higher (Fy. x = 6. 04). No sig- 
nificant differences were found between slow pupils 
in the two districts. 

Achievement of the Sample VII pupils between the 
end of the eighth grade and the end of grade ten 
probably provides the most reliable com parative 
data for this sample. These data include adequate 
numbers of cases, extend over a two-year period 
and compare gains on two differentforms of the 
STEP mathematics and science tests. Comparisons 


in mathematics and science yieldedonly one signifi- 
cant difference between pupils of comparable ability 
in the two districts. In science achievement the 
District R slow pupils were significantly high- 
er than comparable District A pupils (Fy. x = 7. 21). 

The overall gains in mathematics and science 
achievement for the four years of the study were al- 
so computed for Sample VII. No significant achieve- 
ment differences were found between District A and 
District R at any of the three ability levels in either 
science or mathematics. The only cumulative four 
year difference that approached statistical signifi- 
cance was between the final adjusted science means 
of superior pupils, and favored the District A sam- 
ple. 


Sample VIII 


This sample was tested at the beginning of the 
eighth grade in October, 1958, and was followed 
through the eleventh grade withfinal achievement 
measures administered in April, 1962. As was the 
case with Sample VII, the first year’s achievement 
analysis for Sample VIII was limited to mathematics 
achievement. End-~of-year achievement based on 
the CAT arithmetic test was adjustedfor initial 
achievement differences on a different form of the 
same test. 

Comparisons of the final adjusted means for the 
first year indicated a significant difference infavor 
of District R average pupils (Fy. x = 4.62). Differ- 
ences between District A and District Rsuperior 
and slow pupils were not statistically significant. 
During the second year of the study, Sample VII pu- 
pils were in the ninth grade. District A superior 
and slow pupils obtained significantly higher adjust- 
ed final mean scores in mathematics than compara- 
ble District R pupils (Fy, x = 4. 35 and 5. 52). No sig- 
nificant differences were found between average pu- 
pils in the two districts inmathematics achievement. 


In comparing science achievement, only one signifi- 
cant difference was found; District A average pupils 
making greater gains (Fy, x = 10. 54). 

The achievement made by Sample VIII pupils be- 
tween the end of the ninth grade and the end of the 
eleventh grade in mathematics and science was 
measured by two forms of the STEP achieve ment 
test in these areas. Because of the wide range of 
electives available during the high-school years, 
these data can be considered only tentative evidence 
on the relative merits of the two grouping treat- 
ments. No significant differences were found be- 
tween the final adjusted means of District A and 
District R pupils in either mathematics or science 
at any of the three ability levels. 


Results for the Four Years 


A comparison of achievement of Sample VIII pu- 
pils over the entire four years of the study was 
made using analysis of covariance. None of the dif- 
ferences in final adjusted means in mathematics or 
science between comparable District A and District 
R pupils were statistically significant. When it is 
remembered that these comparisons represented the 
cumulative effects of four years under the ability 
grouping and random grouping treatments, the lack 
of any differences reaching the .05 level certainly 
suggests that the two systems had little or no differ- 
ential effect upon the long term achievement gains 
of these pupils. 


Sample IX 


This sample was tested at the beginning of the 
ninth grade in October, 1958, andwas followed 
through the end of the twelfth grade when final 
achievement measures were administered This 
group was somewhat smaller than samples at the 
other grade levels, but represents the best evidence 
available in this research concerning the relative 
achievement gains of ability grouped and random 
grouped pupils in mathematics and science at the 
high school level. During three of the four years of 
the study, these pupils were high school students. 
Limitations imposed by the range of mathematics 
and science electives at the high school level should 
be considered when interpreting Sample IX achieve- 
ment results. 

Achievement data during the ninth grade was lim- 
ited to mathematics achievement on the California 


Achievement Test. No significant differences were 


found between the abilitygrouped and random 
grouped pupilsduring this year. Analysisfor the 
second year of the study was based on the CAT bat- 
tery administered at the end of the ninth grade and 
the STEP mathematics and science tests adminis- 
tered at the end of the tenth grade. 

No significant differences were found between 
District A and District R pupils of comparable abil- 
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ity in either mathematics or science achievement. 
Achievement during the period between the end of 
the tenth and the end of the twelfth grade was meas- 
ured by two forms of the STEP mathematics and sci- 
ence tests. In mathematics, the District A superi- 
or group had a significantly higher final adjusted 
mean score (Fy, x = 5.51). Ability grouped pupils 
at the average level also had a significantly higher 
adjusted final mean score in mathematics (Fy, x = 
6.57). No significant difference was found between 
slow pupils in the two grouping systems. Compari- 
sons of science achievement for District A and Dis- 
trict R pupils of similar ability revealed no signifi- 
cant differences over this two year period. 


Results for the Four Years 


Analysis of the achievement gains for the four 
years of the study revealed significant differences 
in final adjusted mathematics achievement means, 
favoring District A average and District A slow pu- 
pils as compared with District R pupils of similar 
ability (Fy, x = 6.14 and 77. 8). No significant dif - 
ferences were found in mathematics between supe- 
rior pupils in the two districts. Comparisons be- 
tween treatments on final adjusted science achieve- 
ment yielded no significant differences. 


SUMMARY AND CONCLUSIONS 


In the preceding pages we have examined insome 
detail the achievement data collected on the five 
samples of pupils studied in this research during 
the four years of the project. Now let us attempt to 
pull these data together in order to appraise the ef- 
fect of ability grouping and random grouping upon 
achievement in the three major school divisions - 
elementary, junior high, and senior high school. 


Elementary School Achievement 


Elementary school achievement data in this re- 
search are based on the performance of Sample IV 
in grades four, five, and six; and Sample VI in 
grade six. Achievement in four subject areas as 
well as overall achievement were compared during 
the elementary school years for these samples. The 
reader will note in Table 6 that during the elemen- 
tary school years, a total of 54 statistical compari- 
sons were made between District A and District R 
pupils. Of these 54 comparisons, 28 were statisti- 
cally significant. Nineteen of these significant dif- 
ferences were found favorable to District A pupils 
while nine favored District R pupils. Atfirst glance 
this would seem to indicate a rather marked over- 
all advantage for ability grouping at the elementary 
level. On looking further, however, we find that of 
the 19 significant differences favoring District A pu- 
pupils, 15 occurred during the first year of the 
study. It is interesting to compare the sixth grade 
results obtained for Sample IV and for Sample VI 


and speculate about the reason for the much larger 
differences found among Sample VI pupils. There 
are several possible reasons for these differences, 
but the most likely appears to be that Hawthorne Ef- 
fect was operating rather strongly infavor of the 
District A program during the first year. If these 
first year overall differences had been due primari- 
ly to a true superiority of ability grouping over ran- 
dom grouping we would expect the differences to 
grow larger each year as the cumulative effects of 
the better system widened the achievement gap. It 
will be seen in Table 6 that no such cumulative ef- 
fects occurred and in fact, most of the achievement 
differences favoring District A disappeared by the 
time Sample IV pupils had completed the sixthgrade. 

Although the data obtained in this study show no 
great overall superiority for either ability grouping 
or random grouping, some interesting differences 
in the two grouping treatments emerge when we con- 
sider the data for each ability level separately. 
When we compare superior pupils inthe two dis- 
tricts, we find that for the 18 achievement compari- 
sons, 11 are statistically significant with ten of 
these favoring the District A pupils. Interms of 
overall achievement differences for the four years 
of the study, District A superior pupils were signifi- 
cantly higher. 

Comparisons between average pupils during the 
elementary school years showed no consistent trend 
favoring either grouping system. Of the 18 com- 
parisons made, seven were significant favoring Dis- 
trict A, four favoring District R and the remainder 
were not significant. Thus, there appears tobe lit- 
tle to choose between random grouping and ability 
grouping so far as the achievement of average pupils 
is concerned. 

Comparisons between slow pupils inthe two 
grouping treatments showed six significant differ- 
ences, four of these favoring District Rand two fav- 
oring District A. District R slow pupils made sig- 
nificantly greater overall achievement gains during 
the four years of the study than their peers in Dis- 
trict A. If, as seems likely, Hawthorne Effect were 
operating strongly in favor of District A during the 
first year, the relative performance of District R 
slow pupils was probably even better than indicated 
in Table 6. 

All in all, we may conclude that neither ability 
grouping with acceleration nor random grouping 
with enrichment is superior for all ability levels of 
elementary school pupils. In general, the relative 
achievement advantages of the two grouping systems 
were slight, but tended to favor ability groupingfor 
superior pupils and random grouping for slow pupils. 
As was hypothesized, the differences for average 
pupils did not consistently favor either grouping 
treatment. 


Junior High School Achievement 
All five of the samples employed inthis study 
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were in junior high school at some time during the 
four years of the project. When we combine the 
achievement data for these five samples, we find 
that a total of 60 statistical comparisons were made 
between comparable groups in the two districts; 33 
in mathematics and 27 in science. Ofthe mathemat- 
ics comparisons, five were significant infavor of 
District A subgroups, five in favor of District R, 
and the remaining 23 were nonsignificant. Of the 
science comparisons, five were significant favoring. 
District A; one favoring District R and the remain- 
ing 21 were nonsignificant. Thus, in terms of the 
overall effects of the two grouping treatments, the 
districts appear comparable in mathematics achieve- 
ment, but there is some tendency for District A pu- 
pils to make greater achievement in science. 

Let us now study the results of the junior highda- 
ta at each of the three ability levels. A total of 11 
comparisons were made between superior District A 
and R students during the junior high school years. 
Of these, four were significant in favor of District 
A, while one favored District R. Of the nine com- 
parisons between science achievement of superior 
pupils, only one difference was statistically signifi- 
cant, favoring District A. 

For average pupils, it will be noted in Table 6 
that two significant differences favored District Rin 
mathematics, and the remaining nine comparisons 
are nonsignificant. The science achievement of av- 
erage pupils shows somewhat better achievement for 
District A pupils with four of the nine comparisons 
being statistically significant, all favoring District 
A. 

In comparing the mathematics achievementof 
slow pupils, we find one significant difference favor- 
ing District A, and two favoring District R. In sci- 
ence, only one of the nine comparisons between slow 


groups was Statistically significant, this favoring 
District R. From the aforementioned data we may 
conclude that at the junior high school level, there 
is some tendency for ability grouping to lead to great- 
er mathematics achievement among superior pupils 
and greater science achievement among average pu- 
pils. Among slow pupils, the differences between 
the two grouping treatments are small, but tend to 
favor District R in both mathematics and science. 


Senior High School Achievement 


Somewhat less data are available in the Utah 
study on senior high school achievement. These da- 
ta are based on the tenth and eleventh grade achieve- 
ment of Sample VIII, and the tenth, eleventh and 
twelfth grade achievement of Sample IX. A total of 
thirty comparisons were made between the m at he- 
matics and science achievement of District A and R 
pupils in these samples. Of these thirty compari- 
sons, four were statistically significant favoring 
District A, and the remaining 26 were nonsignificant. 
All four of the significant differences were in math- 
ematics achievement with one difference between 
superior pupils, two between average pupils and one 
between slow pupils in the two districts. 

These data would lead to the tentative conclusion 
that ability grouping is associated withslightly great- 
er mathematics achievement at all ability levels dur- 
ing the high school years. It may alsobe concluded 
that the grouping treatments have no effect upon sci- 
ence achievement during these years. In reviewing 
the high school achievement data, the reader should 
recall that these data can be accepted withless con- 
fidence than either the elementary or junior high 
school data for the reasons previously discussed. 
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OVERACHIEVERS AND UNDERACHIEVERS ° 


ALTHOUGH a number of studies have been con- 
ducted in which achievement under different group- 
ing systems has been compared, achievement of in- 
dividual pupils in these systems as related to the 
achievement one might expect on the basis of the in- 
dividual’s ability has not beenexplored. Inthis 
phase of the Utah study, samples of elementary 
school pupils from random grouped and ability 
grouped classrooms were classified as overachiev- 
ers, underachievers and normal achievers basedon 
the relationship between their achievement, chron- 
ological age, and mental age. The proportionof 
overachievers, normal achievers and underachiev- 
ers among pupils of different ability levels in the 
two systems were then studied in an effort to obtain 
insight into the question of who is motivated to 
achieve under these two methods of grouping. 


WORKING HYPOTHESES 


The following working hypotheses were estab- 
lished with respect to the performance of overachiev- 
ers, underachievers, and normal achievers in abil- 
ity grouped and random grouped classrooms: 

1. There will be no significant difference in the 
overall proportion of overachievers, underachievers, 
and normal achievers in the ability grouped sample 
as compared with the random grouped sample. 

2. There will be no significant differences inthe 
proportions of overachievers, normalachievers and 
underachievers between the fourth and sixth grade 
samples. 

3. There will be significantly more boys than 
girls who are classified as underachievers and few- 
er boys who are classified as overachievers. 

4. There will be a significant tendency for su- 
perior pupils in the ability grouping system to be 
more frequently overachievers and less frequently 
underachievers than in the random grouping system. 
This hypothesis is based on the observation that 
schools employing random grouping often slant the 
level of work to the average pupil and in so doing 
may fail to challenge the superior pupil. 

o. There will be no significant difference be- 
tween the ability grouping system and the random 
grouping system in the proportion of overachievers, 
normal achievers, and underachievers among aver- 
age pupils. Because of the tendency to aim thecur- 
riculum at the average pupil inthe heterogeneous 
class, the two grouping treatments involved in the 
Utah study would be most similar for pupils at this 


ability level. One difference between the treatments 
that may affect the average pupil, is his association 
with superior pupils inthe heterogeneous class. 
Some educators have maintained that the average 
pupil is motivated to greater achievement because 
of the presence of superior pupils in the heteroge- 
neous Classroom. If this claim is valid, more over- 
achievers and fewer underachievers should be found 
in the District R average sample. 

6. There will be significantly fewer underachiev- 
ers and more overachievers among slow pupils in 
the random grouping system than inthe ability group- 
ing system. This hypothesis is based on the same 
rationale as hypothesis four. It is anticipated that 
random grouping, by slanting instruction to the av- 
erage pupil, will tend to put pressure on the slow 
pupil to achieve. 


PROCEDURE 


Sample 


The data collected in the Utah study that deal with 
achievement potential are based upon measures ob- 
tained on pupils in Samples IV and VIduring the 
first year of the study. A totalof674District A pu- 
pils and 793 District R pupils were compared. These 
pupils were classified in terms of achievement cat- 
egory (overachievers, normal achievers, and under- 
achievers) sex, and ability level (superior, average, 
and slow). 


Measures 


There is no general agreement among educators 
on a precise definition of ‘‘overachiever’’ and ‘‘un- 
derachiever’’. Most definitions, however, provide 
for comparisons between the individual’s achieve- 
ment and his achievement potential as estimated by 
an intelligence test. Underachievers are defined as 
achieving at some point below their potential achieve- 
ment. Overachievers are less frequently discussed. 
Most educators have difficulty accepting the concept 
of the ‘‘overachiever’’ as this implies that the in- 
dividual is achieving more than his potential. Inthis 
study we have defined the underachiever as a person 
who is achieving more than nine months below the 
normal level expected on the basis of mental age and 
chronological age, and the overachiever as a person 
achieving the same amount above this expected level. 
The expected level, however, should not be thought 
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of as the individual’s maximum potential. The con- 
cept of the overachiever becomes more acceptable 
if we consider the following analogy. Let us say that 
the normal achiever is similar to the average auto- 
mobile that has eight cylinders but generally oper- 
ates on only five or six of them. The underachiev- 
er then becomes comparable to the auto that lurches 
unsteadily along on three or four of its eight cylin- 
ders while the overachiever may be compared with 
that rare vehicle that speeds along on seven or per- 
haps even all eight cylinders. Thus, the overachiev- 
er performs better than the average achiever of 
comparable ability, and more closely approaches 
his achievement potential. 

In order to identify underachievers, the mean 
achievement score of fourth and sixth grade pupils 
on form W of the California Achievement Test Bat- 
tery administered at the end of the school year was 
calculated for both districts. Theoverall mean 
grade placement score on the CAT for the District 
A sixth grade group was 7. 58 as compared with the 
overall mean grade placement for the District R 
group of 7.48. Comparable means for the fourth 
grade were 5. 67 and 5.68. The classificationof pu- 
pils as underachievers and overachievers presents 
some problems. A number of methods have been 
used from time to time including the rating of any 
pupils who are not achieving up totheir normal 
grade placement as underachievers. Such a system 
ignores the pupil’s achievement relative to his abil- 
ity. Another system defines the underachiever asa 
pupil whose educational age is lower than his mental 
age. This system seems to place excessive weight 
on ability and tends to classify the majority of bright 
pupils as underachievers. Althoughthe bright pupil 
may be expected to score well above his grade place- 
ment in subjects such as reading that are not closely 
tied to the learning of specific sequences of opera- 
tions, it is not reasonable to expect the bright pupil 
to score greatly above his grade placement in sub- 
jects such as mathematics where a knowledge of cer- 
tain operations is essential forworking the more 
advanced problems. 

In the Utah study a system of classification for 
underachievers developed by the Santa Monica Cali- 
fornia Unified School District was employed. The 
formula used in this case determines the expected 
grade placement by taking the sum of twice the pu- 
pil’s mental age plus his chronological age and divid- 
ing by three. The mental age used inthis study was 
based upon the Language Score of the California 
Test of Mental Maturity (CMM). Thisformula is, of 
course, arbitrary, but seems to be reasonable as it 
takes into account both mental age andchronological 
age in arriving at an expected educational age score. 
Fach pupil in the sample was given a score that rep- 
resented the difference between his expected educa- 
tional age in months, based on the Santa Monica for- 
mula and his obtained educational age in months 


based on the California Achievement Test Battery, 


form W. This battery was found to yield educational 
age scores consistently higher than expected. The 
median educational age on the CAT was four months 

higher than the median expected educational age cal- 
culated by the Santa Monica formula. The median 

educational age was used as the center ofthe normal 
achieving group. Pupils whose educational age was 
nine months or more above this median were class- 
ified as ‘‘overachievers’’, while those whose educa- 
tional age was nine months or more below this me- 

dian were classified as ‘‘underachieverg’’, 


RESULTS 


The significance of differences inthe proportions 
of overachievers, normal achievers, and under- 
achievers in the two grouping treatments was deter- 
mined using chi square contingencytables. Overall 
comparisons between pupils in the two grouping 
treatments were made as well as comparisons be - 
tween the sexes, between pupils in thefourthversus 
sixth grades, and between District A and District R 
pupils at a given ability level. 


Sex Differences 


The first comparisons that were made were 
aimed at determining whether sex differences oc- 
curred in the overall proportion of over achievers 
and underachievers inthe twotreatments. Chi 
square contingency tables were setup com paring 
boys with girls in the proportion of underachievers, 
normal achievers, and overachievers. In all ten of 
the comparisons made, there was a greater propor- 
tion of overachievers and lesser proportion of under- 
achievers among girls. Seven of these ten differ- 
ences were statistically significant (see Table 7). 


Grade Differences 


The next step in the analysis involved determin- 
ing whether differences existed between the propor- 
tion of overachievers, normal achievers and under- 
achievers in the fourth grade as contrasted with the 
sixth grade samples within each grouping treatment. 
An examination of the chi square contingency tables 
revealed that in every case there was atrend to- 
ward fewer overachievers and a greater proportion 
of underachievers from the fourthtothe sixthgrades. 
In District A, this tendency was significant at the 
.01 level for both boys and girls. InDistrict R, the 
same trend existed although it was somewhat more 
marked in the case of the girls. This trend toward 
a drop in the proportion of pupils who were achiev- 
ing up to or beyond expectation and an increase in 
the number of pupils who were achieving below ex- 
pectation may have been the result of achange in the 
pupils’ value system that occurred between these 
two grade levels. At the fourth-grade level, most 
pupils are still oriented strongly toward the teacher 
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and are likely to value school achievement more 
highly because of the teacher’s influence. By sixth 
grade, however, the pupil’s orientation has shifted 

to a considerable extent to the peer group in which 
school achievement may be valued very little or may 
even be ridiculed as a sign of submissionto adult 
authority. 


Between-Treatment Differences 


Although the sex differences and grade differ- 
ences we have discussed provide useful insights in- 
to the nature of underachievement, the main purpose 
of the study was to compare the proportion of over- 
achievers, normal achievers and under achievers 
found in the two grouping treatments. 

A noteworthy finding in comparing underachiev- 
ers in the two systems of education is the markedly 
greater percentage of underachievers in the random 
grouped system. Of the 793 pupilsinDistrict R, 175 
or 22 percent were classified as underachievers. In 
contrast, only 105 of the 674 District A cases or 
15. 6 percent were classified as underachievers, 
This difference is highly significant. Asthe courses 
of study and overall achievement levels within the 
two districts were very similar, the comparatively 
large number of underachievers in the random 
grouped sample may be an important factor in eval- 
uating the relative merits of random versus ability 
grouping. Comparisons of the number of overachiev- 
ers show even greater differences. The ability 
grouped sample had 29.1 percent overachievers as 
compared with 18.9 percent in the random grouped 
sample. This difference is highly significant beyond 
the . 001 level. 

The proportions of overachievers and under- 
achievers among ability grouped and random grouped 
subsamples of comparable ability also showed some 
interesting differences (see Table 8). Examination 
of comparable subsamples indicated thatthe percent- 
ages of boys and girls in the districts were general- 
ly similar. Therefore, for this phase of the analy- 
sis, the sexes have been combined within each sub- 
sample. Fourth grade and sixth grade groups have 
been analyzed separately so that the results for one 
grade could be checked against the results for the 
other grade, and so that trends could be studied. To 
compare the proportions of overachievers and under- 
achievers in comparable ability subgroups of the two 
districts, chi square contingency tables were set up 
in which superior, average and slow students in the 
ability grouped sample were compared withthe same 
subgroups in the random grouped sample. 

The comparison between superior pupils showed 
that the ability grouped sample had a greater pro- 
portion of overachievers andasmaller proportion of 
underachievers at both fourth and sixth grades. In 
the fourth grade sample, 47.2 percent of the ability 
grouped pupils were classified as overachievers as 
compared with 31.6 percent of random grouped 


pupils. Greater differences in the same direction 
were found in comparing the superior sixth grade 
pupils in the ability grouped and random grouped 
samples. Among ability grouped superior pupils, 
41.3 percent classified as overachievers as con- 
trasted with 13 percent of the superior pupils inran- 
dom grouped classes. Differences between the two 
systems in the proportions of sixth grade superior 
students classified as underachievers were smaller 
with 16. 7 percent of District A group and 20 percent 
of the District R group so classified. 

The proportions of overachievers, normal achiev- 
ers, and underachievers among average pupils inthe 
two districts were not significantly different for 
either the fourth or sixth grade samples. A larger 
proportion of underachievers was found among the 
average pupils in random grouped classrooms but 
this difference was not large enough to establish a 
significant overall trend. 

A comparison of slowfourth grade pupils in abil- 
ity grouped and random grouped classes also showed 
no significant difference in the proportion of over- 
achievers, normal achievers, and underachievers. 
These results were supported by data on slow sixth- 
grade pupils in the two samples which also failed to 
reveal significant differences. 


SUMMARY AND CONC LUSIONS 


On the basis of these findings, the following con- 
clusions can be drawn: 

1. There was a significant tendency for girls 
more frequently to be overachievers and less fre- 
quently to be underachievers than boys. This trend 
was present at both grade levels and in both treat- 
ments. 

2. A comparison of fourth and sixth grade pu- 
pils in each of the grouping treatments indicated a 
Significant trend for both boys and girls to achieve 
less at the sixth grade level relative to their poten- 
tial. In all cases, the sixth grade samples hada 
greater number of underachievers andasmaller 
number of overachievers than comparable fourth 
grade samples. This drop was considerably great- 
er for girls in District R than for girls in District 
A which might suggest that ability grouping tends to 
lessen the trend toward lower achievement between 
the fourth and sixth grades. This implication is not 
borne out by the results of the boys, however, whose 
drop in relative achievement was substantially the 
same in both the random grouped and the ability 
grouped samples. 

3. Overall comparisons between Districts Aand 
R indicated that ability grouped pupils were more 
often classified as overachievers andless often 
classified as underachievers thanwere random 
grouped pupils. 

4. In comparing pupils at the three ability levels 
in the two districts, the ability grouped samples 
were generally found to have a more favorable dis- 
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TABLE 7 


DIFFERENCES BETWEEN BOYS AND GIRLS IN THE DISTRIBUTION OF OVERACHIEVERS, 
NORMAL ACHIEVERS AND UNDERACHIEVERS 


Favored Chi Significance 
Group* Square Level 
District A 
Superior Boys vs. Superior Girls G 7.58 .05 
Average Boys vs. Average Girls G 3. 87 NS 
Slow Boys vs. Slow Girls G 3. 56 NS 
Boys vs. Girls (overall) (4th grade) G 8. 45 .05 
Boys vs. Girls (overall) (6th grade) G 16.07 .001 
District R 
Superior Boys vs. Superior Girls G 8. 63 05 
Average Boys vs. Average Girls G 22.23 Ol 
Slow Boys vs. Slow Girls G 3. 00 NS 
Boys vs. Girls (overall) (4th grade) G 19. 04 001 
Boys vs. Girls (overall) (6th grade) G 17. 36 001 


&<G?’ indicates girls had a more favorable distribution of overachievers, normal achievers 
and underachievers. 


TABLE 8 


DIFFERENCES BETWEEN DISTRICT A AND DISTRICT R PUPILS IN THE DISTRIBUTION OF 
OVERACHIEVERS, NORMAL ACHIEVERS AND UNDERACHIEVERS 


Favored Chi Significance 
Group@ Square Level 
4th Grade Superior Pupils A 8. 43 .05 
6th Grade Superior Pupils A 41.14 . 001 
4th Grade Average Pupils A 3.93 NS 
6th Grade Average Pupils A 1. 76 NS 
4th Grade Slow Pupils A 3. 94 NS 
6th Grade Slow Pupils A 4.68 NS 


a. a ere ae 
Indicates ability grouped pupils had a n.ore favorable distribution of overachievers, norn.al achievers 
and underachievers. 
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tribution of overachievers and underachievers at 
both grade levels. The differences between su pe- 
rior pupils in the two districts were _ statistically 
significant at both grade levels with District A hav- 
ing more overachievers than District Randa small- 
er percentage of underachievers. These results 
seem to support the often stated contention that abil- 
ity grouping presents a greater challenge to su- 
perior students. 

5. No significant differences were found between 
the proportions of overachievers, underachievers, 
and normal achievers among pupils of average abil- 
ity in the two districts. These results support the 
working hypothesis and suggest, as might be expect- 
ed, that the two systems of grouping have little ef- 
fect on the achievement or motivation of average pu- 
pils. In random grouping systems the curriculum 
is generally aimed at the average pupil. Thus, one 
of the oft-stated advantages of the ability grouping, 
that it fits the curriculum more accurately to the 
needs of the pupil, does not apply to the average 


groups. 


6. Comparisons of the proportions of overachiev- 
ers, underachievers and normal achievers among 
slow pupils in the two systems revealed no signifi- 
cant differences. For slow pupils, the ability group- 
ing system is generally thought to have the advan- 
tage of presenting the student with more suitable 
curricular materials while the random grouping 
system might be expected to produce greater 
achievement because of the pressure put onthe slow 
pupil to keep up with the rest of the class. If these 
factors are operating, it must be concluded that 
neither is of sufficient importance to lead to signif- 
icantly different achievement patterns for the slow 
pupil. 


The reader is reminded that the above conclu- 
sions are based upon achievement data collected 
during the pilot study year of the Utah project and 
should therefore be considered tentative until sup- 
ported by additional research evidence. 
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CHAPTER 5 


STUDY METHODS AND ATTITUDES 


ALTHOUGH no previous research was found that 
was concerned directly with the effects of ability 
grouping and random grouping upon study habits, this 
area seemed a worthwhile one to explore. 


WORKING HYPOTHESES 


1, It was hypothesized that superior pupils in 
ability grouped classrooms would obtain signifi- 
cantly higher scores on the study habit measures 
than would superior pupils in random grouped class- 
rooms. Many educators are convinced that ability 
grouping tends to develop better study habits. This 
belief is particularly strong with regardto superior 
pupils and is based on the idea that the superior pu- 
pil in a random grouped class is not sufficiently chal- 
lenged and is capable of doing the assigned work with 
very little real effort. Thus, he tends to develop 
poor study habits and attitudes. Conversely, in an 
ability grouped classroom the bright student is 
forced to compete with other students of equal abil- 
ity in a more challenging program, requiring him to 
develop better study habits in order to maintain his 
position in the class. 

In view of the lack of evidence in this area and 
the absence of factors in the grouping treatments 
that would suggest a clear cut superiority for either 
treatment, there is little reason to expect that abil- 
ity grouping would tend to developbetter study habits 
among average or slow pupils. Infact, it might be 
argued that the average pupil would be motivated to 
study harder in the random grouped class where he 
is challenged by superior pupils. Withregard to the 
slow pupil, the same argument can be made, but a 
counter argument is also possible to the effect that 
the slow pupil in the random grouped class is faced 
with competition he cannot hope to meet and may 
thus react with some form of rationalization that 
would tend to reduce his efforts rather than increase 
them. 

2. It was hypothesized that there would be no sig- 
nificant differences between the scores of average 
and slow pupils in the two grouping treatmentson 
the study habits measures. 


PROCEDURE 


Considerable evidence concerning study habits 
was collected over the four years of the Utah study. 
Pupils in Samples IV, VI, VIII, and Xwere admin- 
istered study habits measures at grade levels rang- 


ing from seven to 12, Two measures were used, 


the California Study Methods Survey (CSMS) and the 
Survey of Study Habits and Attitudes (SSHA). The 


CSMS contains 150 items and yields three subscores 
and total score. The subscores are Attitude Toward 
School, Mechanics of Study, and Planning and Sys- 
tem. In addition to these scores, a verification key 
is available. The test manual suggests acritical 
score of 17 and indicates that verification scores be- 
low this point usually mean that the student is either 
attempting to fake an unduly favorable score, is 
careless in his marking of the test, or has omitted 
too many items. In this research, pupils below the 
suggested critical score were eliminated prior to 
carrying out the analysis. 

Unlike the CSMS, the SSHA yields a total score 
only. The measure contains 75 items, about half 
dealing with study methods and procedures and half 
dealing with study attitudes. The test-retest relia- 
bility of the total score is reported as . 83 for boys 
and . 80 for girls (19), which is comparable to the 
total score reliablity of . 86 for the CSMS (23). 


RESULTS 


Sample IV 


Because of the lack of suitable study habits meas- 
ures that can be administered at the elementary lev- 
el, Sample IV pupils were not administered a study 
habits test until the final year of the study when they 
were in grade seven. Near the end of grade seven, 
these pupils were administered the California Study 
Methods Survey (CSMS). During three years of the 
four devoted to this research, these pupils were at 
the elementary level. Thus, the scores obtained in 
grade seven provide the best evidence available in 
this project on the relative effectiveness of random 
grouping with enrichment versus ability grou ping 
with acceleration in developing study methods at the 
elementary level. 

As no suitable study methods measure was avail- 
able for administration to Sample IV at the beginning 
of the study, one cannot be certain that initial differ- 
ences had not already developed between pupils in 
the two districts prior to the beginning of the re- 
search project. The elementary schools from which 
these samples were drawn were closely comparable, 
however, and both districts had employed random 
grouping in self contained classrooms prior to the 
research, Thus, it seems reasonably safe to assume 
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that the groups were initially similar interms of 
study habits. In any event, it would be surprising 
to find significant study habit differences developing 
in two such similar school systems by the beginning 
of grade four. 

Results obtained from the seventh grade admin- 
istration of the California Study Methods Survey to 
727 Sample IV pupils may be found in Table 9. 

In looking over these results one is impressed by 
the consistently higher scores of District R pupils. 
Although it is possible that some of these differences 
were already present at the beginning of the fourth 
grade it appears highly probable that at least part of 
the differences that emerged are attributable to the 
grouping treatments. District Rsuperior pupils 
made significantly higher mean scores on all three 
of the CSMS subtest scores plus the total score as 
compared with District A superior pupils. All dif- 
ferences between average groups inthe two districts 
were also statistically significant favoring District 
R. Among slow pupils, the Mechanics of Study mean 
significantly favored the District Rsample atthe 
.01 level, as did the total score. Differences be- 
tween the slow groups on the other two subtests were 
not statistically significant. 

It is interesting to speculate on possible reasons 
for the consistent superiority of the study habit 
scores earned by District R pupils inSample IV. 
For the superior pupil, it may be that the enrich- 
ment program in effect in District Relementary 
schools provided more time for independent study 
than was available in the ability grouped system 
with its accelerated curriculum. In the process of 
independent study the superior pupil in District R 
may have developed the better study methods that 
are reflected in the CSMS scores. 

This explanation, however, would hardly account 
for the differences in average pupils, because the 
amount of time the average pupil has to devote to en- 
richment materials in the heterogeneous classroom 
is rather limited and the materials available are 
usually aimed at the extreme groups rather than the 
average. These pupils, however, may have been in- 
fluenced by another factor, namely the challenge 
presented to them by the presence of superior pupils 
may have motivated the average pupils to set higher 
goals, and better study habits and attitudes may have 
resulted. This same factor could also have oper - 
ated to motivate low ability pupils although one 
would expect it to be less effective for this group 
because many of the goals and performance stand- 
ards set by superior pupils in the heterogeneous 
class would be unattainable for students of low 
ability. 

One must conclude on the basis of the Sample IV 
data that ability grouping of the type used inDistrict 
A does not lead to better study habitsfor elementary 
pupils at any ability level. Instead, the data indi- 
cate that heterogeneous grouping with enrichment 
tends to develop better study habits at the elemen- 


tary level, especially among superior and average 
pupils. 


Sample VI 


The California Study Methods Survey was admin- 
istered to Sample VI pupils near the endofthe ninth 


grade. As was the case with Sample IV, no initial 

study habits measure was obtained. For the same 

reasons given for Sample IV, however, there seems 

to be a sound basis for expecting that little or no in- 
itial difference existed between pupils of comparable 
ability in the two districts. 

After eliminating incomplete cases and those who 
scored less than 17 on the CSMS verification score, 
a total of 673 complete cases were available for 
analysis of variance. A summary ofthe results ob- 
tained by comparing pupils of similar ability in Dis- 
trict A and District R may be found in Table 10. 

In looking over the differences on the CSMS for 
Sample VI pupils in the two districts the lack of sig- 
nificant differences for superior and slow pupils 
provide a sharp contrast with the Sample IV results. 
For both of the samples, District R average pupils 
were significantly higher on all three subtests and 
on the total scores. District R superior and slow 
pupils received significantly higher scores on all 
three subtests and on the total scores in Sample IV, 
while none of these differences was significant for 
Sample VI. 

One might wonder why similar differences were 
not found for Sample VI. Perhaps the most likely 
explanation is that Sample IV results for the most 
part reflect changes due to differences in the group- 
ing treatments at the elementary school level; while 
Sample VI data reflect, for the most part, the ef- 
fects of the two grouping systems at the junior high 
school level. At the elementary level the grouping 
treatments influence the entire school day, while at 
the junior high-school level, pupils were in ability 
grouped classrooms for mathematics and science 
only. 

The lack of significant between-treatment differ- 
ences for superior and slow pupilswould suggest 
that at the junior high school level, enrichment is 
less effective in developing study habits than at the 
elementary school level. This might be expected 
because the self-contained classroom would seem to 
offer better opportunities for enrichment than the 
departmentalized class organizationemployed atthe 
junior high-school level. 

Perhaps the best explanation for the consistently 
higher scores of District R average groups in both 
Sample IV and Sample VI is that average pupils in 
heterogeneous classes are challenged to a greater 
extent than average pupils in classes from which the 
superior pupils have been removed. It is doubtful 
if enrichment contributed materially to the better 
study habits of District R average pupils at either 
grade level as the enrichment program tends toem- 
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TABLE 9 


MEAN SCORES OF SAMPLE IV SUBGROUPS ON THE CALIFORNIA STUDY METHODS SURVEY 


Means 
Attitudes Mechanics Planning Total 

Group N Toward School of Study and System Score 
Superior 

Ability Grouped (Dist. A) 97 35. 59 40.25 20. 51 96. 37 

Random Grouped (Dist. R) 202 38. 52a 41. 86b 22. 23a 102. 624 
Average 

Ability Grouped (Dist. A) 148 31.57 36. 32 19, 22 87.10 

Random Grouped (Dist. R) 157 34, 434 38. 302 20. 570 93, 294 
Slow 

Ability Grouped (Dist. A) 42 30. 33 32.28 19.14 81. 75 

Random Grouped (Dist. R) 81 31. 39 36. 424 19. 74 87. 584 
Total 727 


4 This indicates a difference significant at the .01 level favoring the indicated group as compared with the 
comparable ability level in the other district. 
b This indicated a difference significant at the . 05 level. 


TABLE 10 


MEAN SCORES OF SAMPLE VI SUBGROUPS ON THE CALIFORNIA STUDY METHODS SURVEY 


oe ee 
——————————————————————rrwn"0aO7OoaoOouuououuuuunununununuuquqdqdasmseoeeee esse eeeeeeeeeeeeeeeeeeeeeeeeeEeETETleleeoleoeeeeeeeeeeeeeeee 


Means 
Attitudes Mechanics Planning Total 

Group N Toward School of Study and System Score 
Superior 

Ability Grouped (Dist. A) 69 35. 35 43.01 20. 33 98. 91 

Random Grouped (Dist. R) 107 36. 03 43.17 19. 66 98. 91 
Average 

Ability Grouped (Dist. A) 226 30. 21 38.12 18.13 86. 37 

Random Grouped (Dist. R) 165 33. 91a 40. 89a 19. 105 93. 90a 
Slow 

Ability Grouped (Dist. A) 29 29.79 34. 48 18. 07 82. 34 

Random Grouped (Dist. R) 77 29. 43 36. 70 17. 93 84, 06 
Total 673 


rr en 


4 This indicates a difference significant at the .01 level favoring the indicated group as compared with the 
comparable ability level in the other district. 
This indicated a difference significant at the .05 level. 
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phasize adjustment of the curriculum for the supe- 
rior and slow pupils and as a rule, little adjustment 
is provided for pupils of average ability. 


Sample VIII 


The Brown Holtzman Survey of Study Habits and 
Attitudes (SSHA) was administered to Sample VIII 


and Sample IX pupils. As bothof these samples 

were tested near the end of junior high school and 

again near the end of high school, they provide evi- 
dence primarily upon the relative effectiveness of 

ability grouping versus random grouping at the high- 
school level. The administration of the SSHA on at 
least two occasions permitted adjustment for initial 
differences, using covariance analysis. 

As the SSHA is not designed for administration 
prior to grade nine, Sample VOI pupils were not 
tested on this measure during the first year of the 
study. They were, however, tested during their 
ninth grade year and again during their eleventh 
grade year. Complete data required for the analy- 
sis of covariance were available for 428 pupils in 
Sample VIII. 

The adjusted eleventh grade means for superior 
pupils in the two districts were very similar, being 
less than one raw score point apart. Slow pupils 
were also found to be very similar. Average pupils 
appear to have fared somewhat better in District R 
as indicated by a significant difference (.01 level) 
in adjusted final means. In terms of the published 
norms, District A average pupils were at the 35th 
percentile while District R pupils were at the 50th 
percentile (19). 

We may conclude from the Sample VIII data that 
the high-school pupil of average ability develops 
better study habits and attitudes in heterogeneous 
classes, probably because of the greater challenge 
or motivation provided him by the presence of supe- 
rior pupils in his classes. On the other hand, there 
seems nothing to choose between thetwo grouping 
systems in terms of their effectiveness in develop- 
ing better study habits in superior or slow pupils. 


Sample IX 


Sample IX pupils were studied from the beginning 
of the ninth grade through the end of the twelfth 
grade. This sample was administered the SSHA on 
three occasions during the study: near the end of 
the ninth grade, near the end of the tenth grade and 


near the end of the twelfth grade. Complete data 
were available for 214 pupils. The twelfth grade 
means were adjusted for initial differences using 
analysis of covariance. 

The adjusted twelfth grade means for pupils of 
comparable ability in the two districts were very 
similar with slightly more than one raw score point 
separating the slow groups and aneven smaller 
amount separating the average groups and the supe- 
rior groups. None of the between-treatment differ- 
ences approached statistical significance. 

In terms of gains in study habits over the period 
from the ninth grade to the twelfth grade, superior 
pupils in both districts made reasonably large gains. 
The average groups and slow groups, however, 
gained very little, earning virtually the same scores 
on the test in the twelfth grade that they had earned 
in the ninth grade. 

The Sample IX results seem to suggest that nei- 
ther the ability grouping system nor the random 
grouping system as practiced by the districts in- 
volved in the Utah study makes any unique contribu- 
tion to the study habits of high school students. 


SUMMARY AND CONC LUSIONS 


The Sample IV results on the CSMS showed all of 
the total score differences and sevenofthe nine sub- 
test differences between comparable groups in Dis- 
tricts A and R to be statistically significant. Differ- 
ences were generally largest between pupils of su- 
perior ability. These results lead to the conclusion 
that pupils in a random grouping situation consistent- 
ly develop better study methods during the elemen- 
tary school years than pupils in an ability grouping 
situation. 

Sample VI average pupils in District R received 
consistently higher scores on the CSMS than com- 
parable District A pupils. No significant between- 
treatment differences were present for slow or su- 
perior pupils. Sample VIII average pupils in Dis- 
trict R received higher SSHA scores. No significant 
between-treatment differences emergedfrom the 
Sample [IX data. It may be concluded that if any ad- 
vantage accrues from heterogeneous grouping at the 
secondary level, it is for average pupils, who made 
significantly greater gains in study habits inSam- 
ples VI and VIII. Although this difference was not 
found in Sample IX, the larger number of subjects in 
the other samples would still justify this conclusion. 


This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:32:32 UTC 
All use subject to http://about.jstor.org/terms 


CHAPTER 6 


NON-COGNITIVE VARIABLES 


THE VAST majority of studies published in the 
area of ability grouping have been concerned with 
achievement. Yet, the critics of ability grouping 
almost never base their criticisms on achievement, 
but instead bring up the possible effects of ability 
grouping upon the non-cognitive aspects of the pu- 
pils’ school experience. They question whether chil- 
dren can experience normal social development in 
ability grouped classes. They suggest that bright 
pupils will become snobs, and slow pupils will de- 
velop inferiority feelings. They ask if the pupil’s 
self-concept will be damaged, and whether negative 
attitudes will develop. In this section, we will re- 
view the rather limited research pertinent to the 
non-cognitive effects of grouping and will then ex- 
amine the findings of the Utah study that relate to 
some of the aforementioned questions. 

The non-cognitive variables covered in this sec- 
tion of the Utah study and the samples tested with 
each are summarized in Table 11. 


PREVIOUS RESEARCH RELATING ABILITY 
GROUPING TO NON-COGNITIVE VARIABLES 


Social Aspects of Ability Grouping 


Ability grouping has probably been criticized 
more frequently in regard to its social aspects than 
it has in regard to academic achievement. Charges 
that ability grouping causes social maladjustment 
are typical of the tirades loosed in opinion articles 
by Tonsor (113), Bettelheim (8), Cavanaugh (25),and 
Ginsberg (41). Lawson (70), conversely, contended 
that homogeneous grouping would not cause malad- 
justment in the child. He felt thatthere is anatural 
tendency for children to gravitate toward contacts 
with other children of similar intellectual levels. 
When a childis withothers with whom he cannot com- 
pete he meets constant failure that is apt to cause 
maladjustment. Goldberg (42) also felt that placing 
youngsters into narrow-range ability groups would 
not bring about poor social or personal adjustment. 
McGaughy (82), in listing opinions for and against 
ability grouping, pointed out that students with sim- 
ilar rates of speed are happier together and enjoy 
school life more, but also that children inslow 
groups are marked as ‘‘dullards’’ or ‘‘dumbbells’’. 

It is readily apparent that there is no uniformity 
of opinion regarding the effect of ability grouping on 
the social development of the pupil. There is evi- 
dence available indicating that the claim that stu- 


dents are more likely to develop undesirable social 
attitudes in ability grouped sections than in random 
grouped sections is not well founded. Studies by 
Goldberg, Passow and Lorge (46), Goldworth (47), 
Rochfort (97), and Bell (7) showed little difference 
in the social or personal attitudes of students grow- 
ing out of the two plans of grouping. Barbe (4), on 
the basis of a questionnaire mailed out to 456 stu- 
dents of special high school classes for gifted chil- 
dren, found that the majority of the respondents re- 
ported that having been members of these special 
classes aided them in making a good adjustment. 
This contradicted the usual complaint about homo- 
geneous grouping to the effect that such children 
have a more difficult time adjustingto other groups. 
Such data, however, must be interpreted with cau- 
tion because of the possibility that the responding 
group represented a biased sample. 

Jackson (63), working with third and fifth grade 
pupils, reported findings generally favorable to abil- 
ity grouping. Not only were the schools bothered 
with only minor behavior problems (no mention was 
made of the extent of behavior problems earlier), 
but these schools found that in ability grouped sec- 
tions truancy almost disappeared. 

Martinson’s (78) study presents some interesting 
evidence on sociometric choices of superior pupils. 
Her experimental groups consisted of 929 boys and 
girls from every type of public school designed to 
represent the entire school population of California. 
The mean IQ of the sample, based on the revised 


Stanford-Binet scale, was 142.6. The subjects were 


matched by pairs with a control group on the basis 

of chronological age, IQ, sex, and socioeconomic 
status. The programs inwhich the subjects partici- 
pated were planned within the general areas of en- 
richment within the classroom, acceleration, and 

Special groupings. In these areas a total of 17 pro- 
grams at various grade levels were evaluated. 

It was found that first graders in the experimen- 
tal programs showed highly significant gains in 
friendship choices, while the control gifted subjects 
showed no change, either positive or negative. 
Again at the fifth and sixth grade levels, the total 
experimental group of 191 subjects showed highly 
Significant gains in social status on the basis of re- 
sponses by their peers in regular classroom situa- 
tions, while their matched controls showed no 
change. Apparently the gifted child’sclassmates 
did not regard him as a less desirable friend be- 
cause special provisions were made for him. Gold- 
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TABLE 11 


SAMPLES ADMINISTERED NON-COGNITIVE MEASURES IN THE UTAH STUDY 


Sample and Grade Level to Which Each Measure was Administered 


Sample 
Measure IV 

1. Three criterion Sociometric 

Choice Measure 4th, 5th, 6th 
2. USU School Inventory (attitudes 

toward peers, teachers, and 

school) 6th 
3. SRA Junior and Youth Inventories 

(Problem check lists) Sth 


4. Index of Adjustment and Values 
(four scores related to self-concept) 5th, 6th, 7th 


Do. California Test of Personality 
(three subtests used) oth 


6. Inventory of Factors GAMIN 
(two factors used, ascendency 
and inferiority) 


7. Objective- Analytic Personality 
Test Factors 
UI - 16 -- Competent Assertiveness 5th 
UI - 24 -- Anxiety to Achieve 


8. California Psychological Inventory 
(all 18 scores obtained) 


9. Individually administered TAT type 
measure (three scores — aggression, 
depression, and inferiority feelings) 5th (slow) 


10. Groups administered TAT type 
measure 6th 


berg (43: 51) supported these findings in her state- 
ment that ‘‘ There is no evidence to support the no- 
tion that grouping has adverse effects on the social 
or personal attitudes or behavior of children. ”’ 
Drews’ study (31), was concerned withthe effects 
of homogeneous and heterogeneous grouping on the 
performance of superior, average and slow pupils 
in ninth grade English classes. Her data provide 
some valuable evidence on the effects of grouping up- 
on sociometric status. In her study, six three- 


Sample Sample Sample Sample 
VI VII VII IX 

6th 
7th 8th 9th 
7th, 8th, 9th 8th, 9th 9th, 10th 9th, 11th 
7th 8th 9th 
7th 
8th 9th 


choice sociometric questions were used. Two of 
these questions required pupil choices on social var- 
iables while the other four were concerned with in- 
tellectual variables. Pupil choices were analyzed 
by question and also composite peer choice scores 
in the social and intellectual categories were cal- 
culated. The pupil’s choices were limited to indi- 
viduals in his English class, which places consider- 
able restriction on choices in the departmentalized 
junior high school situation. In the heterogeneous 
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classes, Drews found that for each of the six socio- 
metric criteria, superior students were chosen 
more frequently than average students and slow stu- 
dents were least often selected. Although marked 
differences occurred in both the social and intellec- 
tual categories, the differences, as might be expect- 
ed, were greater in the intellectual area. Drews 
also compared the distributions of sociometric 
choice scores obtained by superior, average and 
slow pupils in her homogeneous and heterogeneous 
classes. Comparisons of mean scores in the two 
types of classes would, of course, be spurious, be- 
cause in the homogeneous class the pupil has no 
choice but to nominate persons of the same ability 
level. The distribution of choice scores, however, 
can be meaningful because if a leadership vacuun: 
exists in the average and slow homogeneous classes 
as has sometimes been suggested, none ofthe aver- 
age and slow pupils would receive enough sociomet- 
ric choice nominations to be classified as ‘‘stars’’. 
She found that although a larger percentage of supe- 
rior pupils received high numbers of nominations in 
both grouping systems, the differences between abil- 
ity levels were much greater in the heterogeneous 
classes. On the intellectual choice questions, if we 
take a sociometric score of 12 as the cut off point, 
we find that 54 percent of the superior pupils in het- 
erogeneous groups obtained more than 12 nomina- 
tions while no slow pupils in the heterogeneous 
groups reached this level. In contrast, 30 percent 
of the superior pupils in homogeneous groups re- 
ceived more than 12 nominations while 28 percent of 
homogeneously grouped slow pupils reached this 
point. On the social questions, approximately 33 
percent of the superior pupils inheterogeneous 
groups received more than 12 nominations with no 
slow pupils receiving this number. In homogeneous 
groups, 30 percent of the superior pupils, and 18 
percent of the slow pupils, received 12 or more 
nominations. These data would indicate that al- 
though a smaller proportion of slow than superior 
pupils in either treatment gained positions of social 
recognition, the slow pupil had almost no chance to 
gain such status in the heterogeneous classes, while 
his chances were nearly as good as the superior pu- 
pil in the homogeneous classes. If Drews’ study 
had extended for more than one year, it seems like- 
ly that the distributions of sociometric choice scores 
in the homogeneous classes would have become even 
more similar as slow pupils gained more experience 
in positions of social leadership. Many psycholo- 
gists have stressed the importance of peer accept- 
ance during the adolescence period. In view of this, 
Drews’ evidence would seem to provide strong sup- 
port for ability grouping at the junior high school 
level. 

Mann’s research (74) concluded that there was 
little real friendship between superior and average 
or below average students inheterogeneously 
grouped classrooms. In this study 67 gifted pupils 
in a total sample of 280 fifthand sixth graders were 


‘placed in regular classes for half of their work and 


in special workshop rooms with other gifted children 
for the other half. On the basis of sociometric 
choices, there were more acceptances and rejec- 
tions of the gifted by the gifted than by the typical 
children. Typical children, as a group, tended to 
accept and reject more typical children than work- 
shop children. These findings suggest that friend- 
ships were basedto aconsiderable degree upon abil- 
ity level. It appears a safe assumption that students 
will select as friends those persons with whom they 
associate more, especially at the elementary school 
level. With this in mind one might speculate as to 
whether the sociometric results of Mann’s study 
were due to similar mental ability or to the greater 
contact among the gifted children than between the 
gifted and typical groups. This increased contact 
would serve to increase the number of sociometric 
choices regardless of the basis used for separating 
the children into groups, thus casting serious doubt 
on the validity of Mann’s conclusions. 

The literature concerning the social aspects of 
ability grouping includes at least some evidence to 
support any stand one might care to take. Most of 
the research studies concluded that the grouping 
programs had no adverse effect on the social adjust- 
ment of the students. Some evidence was presented 
that indicated an improved social relationshipin 
ability grouped classes, while other evidence su p- 
ported the critics of ability grouping by indicating 
that the development of undesirable attitudes result- 
ed from such grouping programs. 


Pupil Attitudes and Grouping 


Almost no research has been reported that is di- 
rectly pertinent to pupil attitudes toward school in 
ability grouped and random grouped classrooms. 
Some data in this area were collected by Simpson 
and Martinson (105). The reader will recall that 
this study was concerned with the effectiveness of 
special programs for gifted children in California, 
Rating scales were developed for use by pupils, 
teachers, and parents that dealt with a number of 
areas somewhat related to pupil attitudes. Among 
the variables covered on these questionnaires were 
included interest in school and rapport with teachers. 
These data, however, were not collected for the con- 
trol groups so that comparisons were not possible 
for most of the samples involved in the study. The 
fifth and sixth grade experimental programs, how- 
ever, included both an enrichment program and a 
special class program. The firstofthese programs 
employed enrichment for superior pupils inthe reg- 
ular classroom, and the second involved a special 
class for superior pupils. These two programs 
were quite similar to the random grouping and abil- 
ity grouping systems in Districts A and Rof the Utah 
study. Rating scale data from pupils and parents 
were collected on these two groups at the beginning 
and end of the one year study, and percentage of 
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gains was calculated. The rating scale itemsonin- 
terest in school, motivation towards learning, and 
intellectual curiosity were classified together as 
‘‘intellectual stimulus’’ by the investigators. Ra- 
tings by pupils and their parents in the specialclass 
situation indicated an average pupil gain onthese 
items of about 80 percent over the year of the study. 
Similar ratings for the enrichment program indi- 
cated gains of about 50 percent. On another perti- 
nent variable, ‘‘relationships with the teacher’’, 
ratings by special class pupils and their parents 
showed a gain of about 45 percent as comparedwith 
a gain of approximately 35 percent for the enrich- 
ment program. Teacher ratings were not available 
for the special class groups. As each of the rating 
scales included only a single item for each areacov- 
ered, these data are subject to serious limitations 
in reliability. The validity of a direct rating scale 
when used by presumably untrained raters must al- 
so be questioned. 

Goldberg and her associates reported that in their 
study of the effect of ability grouping per se, com- 
parisons between pupils in homogeneous and heter- 
ogeneous classes on attitudes towards school and at- 
titudes towards more or less able peers yielded re- 
sults that were largely negative. 6 

A study of Passow, Goldberg, and Link (89) com- 
pared four groups of gifted junior high school pupils 
in mathematics achievement from the beginning of 
the seventh to the end of the ninth grade. Two of 
these groups participated in anaccelerated program, 
one in an enrichment program andthere was one 
control group. In addition to measuring achieve- 
ment differences, this study administered an atti- 
tude inventory at the beginning of the seventh and 
end of the ninth grade. This inventory contained 
items in six attitude areas related to mathematics. 
The groups were closely comparable on the initial 
attitude test. Analysis of the total score of the at- 
titude inventory administered at the endof the ninth 
grade showed the two classes in the accelerated 
program to have similar means (60.2 and 63. 6). 
These means were significantly higher than those of 
pupils in the enrichment group (53. 6) andthe control 
group (53. 7). These data suggest that acceleration 
develops considerably more favorable attitudes to- 
ward mathematics than enrichment among groups of 
homogeneously grouped gifted pupils. A limitation 
of this study is that only one classroom was in- 
volved in each of the four grouping treatments. Four 
different teachers taught these classes. Thus, itis 
very possible that some of the differences obtained 
were due to teacher differences rather thantotreat- 
ment differences. 

Drews’ study (31) of homogeneous and heteroge- 
neous grouping in ninth grade English classes was 
also concerned with pupil attitudes. Interest in 
school was obtained by asking pupils to check ‘‘yes’’ 
or ‘‘no’’ in answer to the question, ‘‘Is school work 
interesting to you?’’ These data were obtained at 


the beginning and end of the one year study, and in- 
dicate that the two grouping treatments had no effect 
on the pupils’ stated interest in school. The use of 
a Single direct question to measure pupil interest 
raises doubt about the reliability and validity of 
these data. 

Attitude towards English as a school subject was 
also obtained at the beginning and end of the research 
project. The results showed that attitudes toward 
English improved for all groups excepting the het- 
erogeneous slow group between the pre-testing and 
post-testing. Four questions were asked of pupils 
in the two treatments to see if differences had de- 
veloped between the attitudes of pupils in the homo- 
geneous versus the heterogeneous English classes. 
The results reflected about the same attitudes for 
average pupils in the two treatments, but for supe- 
rior and slow pupils the homogeneous classes elic- 
ited much more favorable attitudes. 


Self-Concept and Ability Grouping 


Only four studies have been reported in the lit- 
erature that are concerned with the relationship be- 
tween ability grouping and self-concept. The first 
such study was carried out by Mann (75) witha sam- 
ple of 102 fifth grade pupils who had been grouped 
into four ability levels since entrance into the first 
grade. Grouping was based uponthe results of 
group intelligence tests and reading readiness tests. 
Sections were labled only by the teachers’ names 
but teachers informally referred to them as ‘‘high- 
est’’, ‘‘lowest’’, ‘‘second highest’’, and ‘‘second 
lowest’’ groups. Self-concept data usedin this study 
were based on the subjects’ anonymous responses to 
two questions. One of these was, ‘‘Tell me which 
fifth grade you are in’’, and the other was, ‘‘Tellme 
how you happen to be in this particular fifth grade 
group rather than some other group.’’ About two- 
thirds of the lowest and highest groups identified 
their groups by ability level. A great majorityof 
the two center groups identified their groups by the 
teacher’s name. In response to being asked why 
they were ina particular group, 25 of the 30 pupils 
in the highest group answered in positive terms rel- 
ative to ability or achievement suchas ‘‘I’m smart’’, 
‘‘. make good grades’’, ‘‘We work a little faster’’, 
and soon. Inthe lowest group, 14 of the 18 pupils 
answered in negative terms relative to abilityor 
achievement such as ‘‘I am too dumb’’, ‘‘Iam not so 
smart’’, ‘‘We don’t know too much.’’ Mann’s study 
clearly indicates that pupils in an ability grouped 
Situation are aware of the nature of their situation, 
but presents no direct evidence onthe effect of group- 
ing onthe student. No control group was employed. 
It is possible that use of a control group would have 
revealed that low and high ability pupils in random 
groups have similar low and high perceptions of 
their ability and achievement. Furthermore, aques- 
tionnaire based on only two items would be expected 
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to have rather low reliability. Mann’s study would 
have been improved considerably by the use of a 
more extensive measure of self-concept. Other 
studies such as the work of Luchins and Luchins 
(72), have also suggested that pupils in an ability 
grouped situation are aware of their grouping status 
regardiess of efforts made by the school to conceal 
the grouping level. This awareness, coupled with 
the associations that the pupil makes with his ability 
level as indicated by Mann, certainly suggests that 
ability grouping may have undesirable effects upon 
the pupil’s self-concept. 

Drews’ research (31) with ability grouped and 
random grouped classes in ninth grade English also 
included measures of self-concept. Drews hypothe- 
sized that slow pupils would see themselves as rel- 
atively more competent in a homogeneous situation 
while the reverse would be true of superior pupils. 
Two self-concept measures were employed to test 
this hypothesis. The Ability Self-Concept Rating 
consisted of a single question asking the student to 
compare his ability with that of his classmates and 
rate himself as above average, about average, or 
below average. The other measure usedwas the 20 
item Concept of Self-as-a-Learner Scale which 
Drews developed from Bills’ Index of Adjustment 
and Values. 

The Ability Self-Concept Rating was adminis- 
tered at the beginning and end of the experimental 
year. Mean scores were analyzed using the t-test 
to determine if significant differences existed be- 
tween homogeneous and heterogeneous classes. On- 
ly one significant difference was found on the pre- 
test analysis — favoring slow pupils in the homoge- 
neous sample. At the end of the one year experi- 
ment, between-treatment comparisons revealed that 
superior pupils in heterogeneous groups and slow 
pupils in homogeneous groups made significantly 
higher scores on this measure. 

Drews administered the Concept-of-Self-as-a 
Learner Scale at the end of the study only. She 
found that although the heterogeneously grouped su- 
perior pupils obtained higher mean scores, the dif- 
ferences were not significant. Only one significant 
between-treatment difference occurred, favoring 
the slow boys in the homogeneous groups. 

An important study by Goldberg and her associ- 
ates has not been completed at the time of this writ- 
ing. She reports, however, that self-attitudes were 
Significantly affected by grouping. 


In narrow range groups the gifted children 
showed a significant decrease intheir self- ap- 
praisals which were considerably inflated to 
begin with. On the other hand, the slow chil- 
dren in narrow-range groups showed in- 
creased self-appraisals which brought them 
more nearly to the level of the average. The 
gifted showed considerable increases inself- 
appraisal while the slow decreased signifi- 


cantly. Depending upon your point of view, 
effects of grouping on self-attitudes may be 
perceived as desirable or undesirable. The 
effects of grouping were even more start- 
ling with regard to self-aspirations. There, 
the broad range group, and especially the 
presence of gifted children tended to act as 
a damper on the aspirational ceilings of the 
slower pupils. @ 


A portion of the California study of gifted children 
(105) showed that eighth grade pupils in special 
classes made greater gains on Self- Acceptance as 
measured by the California Psychological Inventory 
than did pupils of comparable IQ, who were assigned 
to heterogeneous control classes. These findings 
agree with those reported by Goldberg but disagree 
with Drews (31). Perhaps these findings differ be- 
cause Drews’ study made special curriculum adjust- 
ments for the heterogeneous groups while the other 
two studies did not. 


Personality and Ability Grouping 


The controversy over ability groupings’ effect up- 
on personality is to a large extent based on opinion 
and philosophical orientation. A variety of behav- 
ioral scientists including Bettelheim (8), Hamalainen 
(56), Tonsor (113), and Gowan (50) have stated their 
Opinions of the effects of ability grouping on the per- 
sonality of the pupil. Most educators also have 
strong feelings on this topic, but when we look for 
research evidence there is very little. Let us now 
review the limited evidence available on the effects 
of ability grouping on personality. 

In an individual interview study, Luchins and Lu- 
chins (72) questioned 190 fourth , fifth and sixth 
grade children about their attitudes toward the abil- 
ity grouping practiced in their school. Every pupil 
was aware of the significance of the superscript, 
‘*1”? (bright), ‘‘2’’ (dull), ‘‘3’’ (average) attached to 
his class. Practically all students in the ‘‘1’’ class 
preferred their class and would choose to remain in 
it even if they did not like the teacher. Among the 
reasons for choosing the ‘‘1’’ class were: they be- 


‘ longed there, they would be bored, they did not want 


to mingle with or be educated with ‘‘dumbbells’’. 
When asked which class they would rather be in, 
children in the average and dull classes chose ‘‘1”’ 
class because there would be stigma attached to be- 
ing in ‘‘2’’ class which ‘‘has a reputation you can’t 
get rid of’’, their parents would be proud of them, 
the teacher would not be so strict, they would have 
more fun, they would learn more, andbesides it 
paid to take the chance since they could always go 
back to ‘‘2’’ class. The few students in the bright 
and average classes who selected the ‘‘2’’ class did 
so because they might get better grades or they could 
be more outstanding. Those in the ‘‘2’’ class who 
selected to remain said it was best for them, they 
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would be out of place in‘‘1’’ class, andthey wouldn’t 
be allowed in ‘‘1’’ class anyway, so why should they 
pick it? Luchins and Luchins concluded that the 
prestige value of ‘‘1’’ class membership was its 
chief attraction, that it caused the able members to 
become snobbish, that the grouping situation in the 
school caused stigma to be attached tothe ‘‘2’’ class 
membership, and that homogeneous grouping creat- 
ed a caste system in the school. Since the subjects 
in Luchins and Luchins’ study were not com pared 
with any controls, it is impossible to conclude defi- 
nitely that the findings were attributable to the group- 
ing situation, although this appears to be the case. 
Students’ feelings, like those reported in Luchins 
and Luchins’ study, appear to be evident to school 
administrators. A survey by Saville (103) revealed 
that a number of school officials considered the 
‘‘stigma’’ attached to pupils involved inability 
grouping to be a major problem. 

Mathias (81), using the Rorschach and TAT to 
measure aggression and depression, found that the 
slow and bright children had significantly more ag- 
gression responses (beyond the . 01 level of signifi- 
cance) thanaverage pupils in heterogeneous classes. 
He concluded this was due to frustration they experi- 
enced in a school program aimed primarily at the 
average pupil. 

Simpson and Martinson (105) administered 
the California Psychological Inventory to their spe- 
cial class groups and control groups at the eighth- 
grade and high-school levels. Subjects involved in 
the eighth-grade phase of the Simpson and Martin- 
son study included 58 boys and 57 girls in the s pe- 


cial classes and 36 boys and 20 girls in the control 
group. It will be recalled that this study was con- 
cerned with superior pupils only. The specialclass 
program was similar to ability grouping except that 
it involved superior pupils only. The CPI was ad- 
ministered to the special class and control groups 
at the beginning and end of the one-year study and 
gains in the subtest scores were tested for signifi- 
cance, 

The special class groups made significant gains 
in 19 instances and significant losses inthree in- 
stances on the CPI. The control group pupils made 
Significant gains in nine instances and _ significant 
losses in eight instances on the CPI. 

This study did not present direct statistical com- 
parisons to determine whether the experimental 
groups made significantly greater gains than the 
control groups. Examination of the eighth-grade 
data available for boys would indicate, however, 
that the special class group probably made signifi- 
cantly greater gains on Self-Acceptance than the 
control group. Similar comparisons for the eighth- 
grade girls appear to indicate that the special class 
girls made a significantly greater relative gain in 
Self-Acceptance and Flexibility than the control 
group. 

The data available for the high-school boys inthe 
special class and control groups would indicate that 
the special class sample made asignificantly great- 
er gain in Social Presence and Tolerance. 

High-school girls in the control group appear to 
have made significantly greater gains than the spe- 
cial class group in Social Presence. 
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SOCIOMETRIC CHOICE® 


PEOPLE ARE an important part of the environ- 
ment to which individuals react in the processes of 
developing and learning. Wrightstone, Justman and 
Robbins (124: 199) have asserted that ‘‘The devel- 
opment of one’s personality is a product of the inter- 
action of the individual with other people. ’’ Accord- 
ing to Otto (88) it is in these person-to-person re- 
lationships in various sizes andtypes of groups that 
many social needs are met and personality develop- 
ment has its roots. The need to belongtoagroupis 
a powerful force. Some psychologists claim that 
this need is an innate human need while others 
maintain that it is culturally developed. All agree 
that, whatever its origin, the need to belong furnish- 
es strong motivation for behavior, for inour society 
almost every individual attempts to become a mem- 
ber of some group. Cunningham, et al. (28: 108) 
state that ‘‘It is important for the development 
of most boys and girls of school age that they find 
belonging in groups of their peers.’’ Dineen and 
Gary (30: 358) express their agreement with this 
concept in maintaining that; 


Chief among the factors necessary for de- 
veloping a full life in a heterogeneous group 
such as a class of children is a satisfactory 
friendship pattern. Whenachild is with 
others who accept him and respond to him, 
that is, others with whom he wants to as- 
sociate, he can contribute more and func- 
tion better in the group. 


Social adjustment has become of major im por- 
tance in education. The alert teacher is aware of 
the need of children to establish and maintain emo- 
tional and social stability. The relationship which 
exists between social adjustment and academic 
achievement has been shownin several recent stud- 
ies (22, 112, 24, 102). In ability grouping, the mat- 
ter of social relationships and adjustments has been 
of primary concern to critics. Oneofthe criticisms 
most often directed at ability grouping in the public 
schools is that the children will suffer social mal- 
adjustment as a result of being placed ina class- 
room limited to fellow students of similar intellec- 
tual caliber. Such a system results in an artificial 
social situation that stigmatizes slow students, ac- 
cording to Hamalainen (56). Onthe other hand, 
many of those who advocate ability grouping claim 
that the slow child has an opportunity to be recog- 
nized socially in such groups. He is denied such 


recognition in randomly grouped classes where he 


must compete with more able students. 

The primary purpose of this phase of the Utah 
study was to investigate the effects of homogeneous 
versus heterogeneous grouping practices onthe soc- 
iometric structure of elementary school class- 
rooms. This represented an attempt to determine 
if systematic differences exist which may be attrib- 
utable to the type of grouping being practiced. 

Sociometric status is defined as the total number 
of times a pupil was selected on a sociometric 
choice measure by his classmates as a preferred 
companion on any one of the three specified activi- 
ties or situations used as criteria. Mobility of soc- 
iometric status may be defined as movement from 
one sociometric classification to another in consec- 
utive years. 9 Mobility of status over a three year 
period was studied in this project inan attempt to de- 
termine whether the social relationships measured 
were transitory or relatively stable. Extending the 
study over a three year period also promised to give 
some indication of the degree to whichincreas ing 
exposure to the two grouping treatments and in- 
creasing age and maturity, with its acc ompanying 
changes in personality characteristics and interests, 
might affect one’s social relationships. Are status 
and acceptance based on stable personality traits, or 
as Staker (107) has suggested, are there different 
traits that are important at different age levels? 

A ‘‘star’’ is defined as an individual who receives 
a large number of choices on a sociometric test, 
more than could be expected by chance alone. One 
who receives fewer choices than could be expected 
by chance, is known as a ‘‘neglectee’’, while those 
falling between stars and the neglectees are ‘‘reg- 
ulars’’. An ‘‘isolate’’ isone who receives no 
choices, while a ‘‘rejectee’’ is one who receives 
negative choices, if negative choices have been ob- 
tained. 

Gronlund and Whitney (53) have suggested that 
social acceptance pervades all aspects of one’s life 
sphere. This would mean that those factors that 
contribute to one’s acceptance by others operate at 
home, on the job, at school, in informal contacts, 
or in any situation in which a person may become 
involved. This idea was supported by Symonds and 
Jackson (110). 

The design of the Utah study provided for the in- 
vestigation of the effects of three different con- 
ditions on measures of sociometric status. These 
conditions were 1) the grouping treatment (ability 
grouped or random grouped) 2) ability level, (based 


on academic achievement) and 3)sex. Comparisons 
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were made between the two districts, among the 
various ability levels, and between the sexes on the 
aforementioned dependent variables. 


WORKING HYPOTHESES 


The following working hypotheses were estab- 
lished regarding the effects of grouping treatment, 
ability level, and sex on the sociometric variables. 

1. It was hypothesized that between-treatment 
comparisons would reveal fewer stars among Dis- 
trict A superior pupils but greater numbers of stars 
among both average and slow pupils in District A. 

2. Furthermore, it was hypothesized that the 
combined distribution of pupils at all ability levels 
into the three sociometric categories of ‘‘stars’’, 
‘‘regulars’’ and ‘‘neglectee-isolates’’, would not be 
significantly different for the two districts. 

A number of previous studies have shown rela- 
tionships between sociometric choice scores and 
achievement or ability in heterogeneous classrooms. 
When one considers the ability grouping situation, 
however, where achievement variability issome- 
what reduced, it appears probable that the effect 
upon superior pupils would be to reduce their chances 
of gaining social recognition. Two changes in the 
classroom social situation seem possible for aver- 
age and slow pupils when placed in an ability 
grouped situation. One of the possible effects of 
ability grouping could be the development of a lead- 
ership vacuum in the average and slow groups re- 
sulting in few if any recognized social leaders or 
stars in these groups. The second alternative 
would be that, with the superior pupils removed, 
pupils in average and slow groups who had not pre- 
viously been recognized in the peer structure would 
emerge as social leaders or stars. If the first of 
the two aforementioned alternatives occurred, the 
overall proportion of stars would be considerably 
less in District A than District R. Onthe other 
hand, if the latter situation occurred, the number 
of stars in District A would be equalor perhaps 
greater than in District R, asfewer superior pupils 
would probably attain the status of stars in the abil- 
ity grouped class but greater numbers of both aver- 
age and slow pupils would reach this level of accept- 
ance. Working hypotheses concerned with socio- 
metric choice patterns in the two districts involved 
in the Utah study were based upon the latter expec- 
tation. 

3. Regarding level differences within eachtreat- 
ment, it was hypothesized that in the random grouped 
situation, where previous studies have demonstrated 
a relationship between status and ability, superior 
pupils would obtain the more favorable sociometric 
status patterns (more stars andfewer neglectee- 
isolates) followed by average and slow pupils inthat 
order. In District A, where ability differences with- 
in the classroom were somewhat less, it was hy- 
pothesized that sociometric choice patterns for pu- 


pils at the three ability levels would not be signifi- 
cantly different. That is, superior, average, and 
slow pupils would all have about the same distribu- 
tion of stars, regulars, and neglectee-isolates. 

4. It was hypothesized that no significant sex 
differences would occur in the sociometric choice 
patterns or mobility patterns of boys and girls of 
comparable ability. 

The degree to which pupils in the two treatments 
changed from one to another sociometric classifica- 
tion from year to year appeared likely to provide 
some valuable insights into the relative effect of the 
two grouping treatments upon the classroom social 
structure. 

0. It was hypothesized that the overall patterns 
for average and slow pupils would tend to improve 
from year to year inthe District Aclassrooms 
while staying about the same in District Rclass- 
rooms. This hypothesis was based on the assump- 
tion that many average and slow pupils would re- 
quire a certain amount of learning and experience 
before they would be able to move out of the back- 
ground and assume positions of social leadership in 
ability grouped classes. Thus, although it might be 
expected that some average and slow pupils would 
move into positions of social leadership very quick- 
ly, it seemed likely that many who had potential for 
social leadership would require a longer experience 
before emerging as stars. This lag could result in 
a temporary leadership vacuum in average and slow 
sections in District A. 

A final goal of this phase of the project was to de- 
termine whether significant relationships existed 
between sociometric status and a number of other 
variables for which data were collected in other 
phases of the Utah study. 

6. It was hypothesized that sociometric status 
would correlate positively with ability and achieve- 
ment for District R pupils but that these variables 
would not be significantly correlated for District A 
pupils. 

7. It was hypothesized that a positive relation- 
ship would exist in both districts between sociomet- 
ric status and favorable attitudes toward the school 
environment. 

8. Positive relationships were also hypothesized 
between sociometric status, and self concept vari- 
ables as measured by the Index of Adjustment and 
Values. 


PROCEDURE 


sample 


Sample VI pupils were administered the socio- 
metric measures only once — in the sixth grade — 
and therefore, this sample was used only in the study 
of sociometric patterns. Sample IV pupils were 
measured in the fourth, fifth, and sixth grades, and 
data from this sample were used to study sociomet- 
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ric status patterns mobility of sociomet- 
ric status, and relationships between sociometric 
status and other variables. Over the three years 
that sociometric data were collected on Sample IV, 
the number of pupils in District R ranged from 496 
to 702, and in District Afrom 292 to 455. There 
were 537 Sample VI pupils measured in District R 
and 425 in District A during the single year that soc- 
iometric data were collected on this sample. 


Measures Used 


A near-sociometric measure of the usual partial- 
rank-order type was developed for use inthis study. 
Three criteria were employed, each with five posi- 
tive choices. Each subject was askedto indicate by 
placing a check mark beside names on a roster of 
his classmates those whom he preferred under each 
criterion. The three criteria were: 1) the five chil- 
dren in the class who are your best friends; 2) the 
five children with whom you prefer to study; and 
3) the five children you would most like to have with 
you if you were transferred to another classroom. 

Measures with which the sociometric data were 


compared included the Bills Index of Adjustment 


and Values, which was used to measurethe self- 


concepts held by the students; the Sequential Tests 
of Educational Progress, that were used to obtain 


a measure of achievement; the California Test of 
Mental Maturity, and the USU School Inventory, a 
locally developed attitude scale, measuring atti- 

tudes of students toward school, teachers, and 
peers. 

Each pupil’s sociometric status score was ob- 
tained by summing the choices he received on the 
three criteria. After obtaining a sociometric choice 
score for each pupil, the pupils were classified as 
stars, regulars, neglectees, or isolates based up- 
on choice limits established by Bronfenbrenner (17). 
Category limits using this system were as follows: 


‘‘stars’’? -------- received 22 or more nominations 
‘‘regulars’’ ----- received 10 to 21 nominations 
‘‘neglectees’’---- received 1 to 9 nominations 
‘‘isolates’’ ------ received no nominations 


For purposes of the analysis, however, neglectees 
and isolates were combined. An extremely small 
number of isolates were obtained in the three-cri- 
terion five-choice sociometric measure. 

Scores on sociometric status were obtained in 
this manner for each of the three years. Data that 
were used to relate sociometric scores to mental 
ability, academic achievement, attitudes, and con- 
cept of self were gathered during the third year of 
the study when the subjects were in the sixth grade. 


Method of Analysis 


The study was concerned primarily with the ef - 
fects of the two grouping treatments, ability level, 
and sex on sociometric status. The various combi- 
nations of these three variables provide 12 groups 


that could be compared on each dependent variable. 
These comparisons were tested for significance us- 
ing either single classification analysis of variance 
or chi square contingency tables. 


RESULTS 
Between- Treatment Differences 


It will be recalled that two possible effects of 
ability grouping on sociometric choice patterns were 
discussed in the working hypotheses. One of these 
was that ability grouping would create a leadership 
vacuum in ability grouped sections made upof aver- 
age and slow pupils, thus reducing the overall pro- 
portion of stars in District A. Inorder tocheck 
this possibility, the overall distribution of stars, 
regulars, and neglectee-isolates for District A and 
R were compared. Four overall comparisons were 
made, one for each year that Sample IV pupils were 
tested; and one for Sample VI pupils. Only one of 
these comparisons showed a significant difference 
between the ability grouped and random grouped sub- 
jects. This difference, between Sample IV pupils in 
District A and R during the first year of study re- 
vealed a smaller number of stars and agreater num- 
ber of regular pupils in District A than would occur 
by chance. These results suggest that during the 
first year of the ability grouping situation, some 
leadership vacuum was present in regular and slow 
groups. A similar trend in the overall sociometric 
choice pattern also occurred during the first year of 
the study for Sample VI. The trend, however, did 
not reach statistical significance for this sample 
and suggests that sixth grade pupils adapt more rap- 
idly to the ability grouped situation than do fourth- 
grade pupils (see Table 12). No such trend, how- 
ever, existed for Sample IV pupils during the sec- 
ond and third years of the study. Overall compari- 
sons for these years indicate a slightly larger num- 
ber of stars in District A than in District R. These 
data would suggest that once the child has adapted to 
his new role in the ability grouped class, the so- 
called leadership vacuum quickly disappears, and 
pupils of average and low ability emerge as stars. 

In addition to these overall between-treat ment 
comparisons, the sociometric choice patterns of 
boys and girls at each ability level in District A 
were compared to those of their counterparts in Dis- 
trict R. The results of these comparisons for the 
three years of the study are shown in Table 12. The 
reader will recall that our working hypothesis pre- 
dicted a smaller number of stars among superior 
pupils in District A. This trend was, in fact, evi- 
dent in the eight comparisons between superior pu- 
pils in the two districts. It will be noted that three 
of these differences reached statistical significance 
and one additional approached significance. The 
three non-significant differences also followed the 
same trend. In an ability grouping situation, supe- 
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TABLE 12 


DIFFERENCES BETWEEN DIFFERENT SUB-GROUPS IN THE FOUR REPLICATIONS 


BASED UPON CHI-SQUARE CONTINGENCY TABLES 


1958-59 1959-60 1960-61 1958-59 
4th Grade oth Grade 6th Grade 6th Grade 
Groups? Purpose of Comparison Sample IV Sample IV Sample IV Sample VI 
Between- Treatment Differences Overall Differences: To 
check overall proportion 
1. All subjects in District R of sociometric classifi- 
versus all subjects in Dis- cation between groups. 
trict A. 6. 594 N. S N. S N. S 
2. Superior Boys, R versus A N. S N. S N. S 6.52 
3. Superior Girls, R versus A These 6 comparisons are’ 8.56 A. S. N. S 6. 04 
between comparable sub- 
4, Average Boys, A versus R groups in terms of sex A. S.¢ 8.53 N. S A. S 
and ability in District A 
0. Average Girls, A versus R versus District R. N. S N. S A. S N. S 
6. Slow Boys, A versus R N. S 12. 87 A. S N. S 
7, Slow Girls, A versus R N. S 9,25 N. S 7, 38 


4Chi-square of 5. 99 significant at .05 level; 9.21 significant at .01 level; 13.8 significant at . 001 level 
Group having more desirable distribution of sociometric classifications listed first when significant 


differences are present. 


CA, §. indicates that difference approached significance (significance level between . 05 and . 10). 


rior pupils, being placed together, must compete 

among themselves for a smaller number of positions 
of social leadership than would be available to them 
in random grouped classrooms. It is noteworthy 

that, although ability grouping reduced the superior 
pupil’s chances of reaching the star classification, 

it did not increase the liklihood of his being classi- 
fied as a neglectee-isolate. 

It was hypothesized that larger numbers of stars 
would be present among average pupils in ability 
grouped classrooms because of the positions of soc- 
ial leadership opened to these pupils by removal of 
the superior pupils from the classroom. It will be 
noted in Table 12 that one of the eight between-treat- 
ment differences for average pupils was statistical- 
ly significant and two others approached signifi- 
cance. These differences plus the remaining four 
non-significant differences were all in the hypothe- 
sized direction. 

It was also hypothesized that a greater number of 
stars would emerge among slow pupils in DistrictA 
because of the removal of superior and average pu- 
pils who might be expected to dominate the social 


situation in a heterogeneous classroom, It will be 
noted in Table 12 that three of the eight comparisons 
between slow pupils in the two districts were sta- 
tistically significant while a fourth approached sig- 
nificance. Examination of the eight chi-square con- 
tingency tables for slow pupils revealed that signif- 
icant differences in all cases were due to a greater 
number of stars and a smaller number of neglectee- 
isolates in District A as compared with District R. 
Cantingency tables not yielding statistically signifi- 
cant chi-squares were also examined and it was 
found that the above trend also occurred in these 
tables. It is noteworthy that between-treatment dif- 
ferences between Sample IV slow pupils during the 
first year were not significant for either boys or 
girls, again suggesting that slow pupils requirea 
certain amount of time for adjustment before they 
emerge as leaders in the ability grouped situation. 
The total pattern revealed in the between treat- 
ment comparisons is strikinglyconsistent. This 
pattern seems to indicate that average and slow pu- 
pils generally have a far better opportunity to attain 
positions of social leadership andare far less likely 
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to be classified as neglectee-isolates in an ability 
grouped situation than in a random grouped situa- 
tion. Superior pupils generally have less chance of 
emerging as stars in the ability grouped classroom. 
Examination of the contingency tables indicate, how- 
ever, that superior pupils do not more frequently 
become neglectee-isolates in the ability grouped 
classroom. 

Part of the aforementioned differences are prob- 
ably an artifact of the ability grouping situation, 
that is, that pupils in ability grouped classrooms 
have no one to choose except their classmates at the 
same ability level. It should be noted, however, 
that this situation would not cause slow or average 
pupils to emerge as stars, unless they had in fact 
gained the level of social recognition necessary for 
star status. The ability grouped situation could 
have produced a vacuum in social leadership in 
which no one would have emerged asastar. The 
fact that such a vacuum did not exist (except in the 
fourth grade) indicates that slow and average pupils 
made real gains in social leadership in the ability 
grouped classes. 


Level Differences 


It was hypothesized that no significant differ- 
ences between District A pupils at different abil- 
ity levels would be found. This hypothesis was 
based on the assumption that in the ability grouped 
situation pupils have about an equal chance of attain- 
ing favorable sociometric positions regardless of 
the ability level of the group in which they have been 
placed. It was thought that some differences be - 
tween superior and slow pupils might emerge in Dis- 
trict A during the first year of the study because of 
the possibility that slow pupils would be unable to 
exploit immediately the better opportunities for soc- 
ial status available to them inability grouped class- 
es. Of the 24 comparisons between pupils of differ- 
ent ability levels in District A, only one reached 
statistical significance at the .05 level. The sig- 
nificant difference was between the sociometric pat- 
terns of superior and slow boys and reflected more 
favorable patterns for superior boys. This differ- 
ence again suggests that slow pupils are not capable 
of filling immediately the leadership vacuum that is 
created when ability grouped classes are estab- 
lished. These data also seem to indicate in are- 
markably consistent manner that inthe ability 
grouped situation pupils have an equal opportunity to 
attain a favored sociometric position regardless of 
their ability level. 

It was hypothesized that significant differences 
would occur between pupils of different ability lev- 
els in District R. This hypothesis was based large- 
ly on previous research indicating that in the het- 
erogeneous class the superior pupil has a greater 
chance of gaining peer group recognition than his 
less capable contemporaries. It was also hypothe- 


sized that in comparisons between average and slow 
pupils, average pupils would have the more favor- 
able sociometric choice pattern butthat these differ- 
ences would generally be smaller thanthose involv- 
ing superior pupils. As superior pupils in the het- 
erogeneous classrooms tend to monopolize positions 
of social leadership, the lack of opportunity for both 
average and slow pupils to emerge as stars would 
reduce the differences betweenthese two groups. A 
total of 24 chi-square comparisons were made be- 
tween District R pupils of different ability levels. 
The eight comparisons between superior and slow 
pupils were all statistically significant beyond the 
.01 level with seven of the eight significant beyond 
the .001 level. Of the eight comparisons between 
superior and average pupils in District R, three 
were significant beyond the .001 level, three were 
significant beyond the .01 level, one approached sig- 
nificance, and one was non-significant. The two 
non-significant differences were consistent indirec- 
tion with the significant differences favoring the su- 
perior pupils. These data seem to present over- 
whelming evidence that the superior pupil is in 
a highly favored sociometric position in the random 
grouped classroom as compared with average or 
Slow pupils. 

Of the eight comparisons between average and slow 
pupils, only two were statistically significant. Both 
the significant and non-significant differences gen- 
erally favored average pupils. These data reflect 
a much larger gap between superior pupils on one 
hand and average and slow pupils on the other hand, 
than is found between the average and slow groups 
in terms of their success in obtaining favorable soc- 
iometric classifications in the random grouped class- 
room. 


Sex Differences 


Comparisons were also made betweenthe distri- 
butions of stars, regulars, and neglectee-isolates 
for boys and girls of comparable ability ineach dis- 
trict. It was hypothesized that no significant sex 
difference would emerge. For District A, none of 
the 12 between-sex comparisons yielded significant 
chi-square values. For District R, two of the be- 
tween-sex comparisons reached statistical signifi- 
cance. Both of these were between superior boys 
and girls, and in both cases, the girls obtained the 
more favorable distribution of sociometric choices. 


Mobility in Sociometric Status 


In order to determine mobility of sociometric 
choice status over the three year period of study, 
the sociometric classification (star, regular, or 
neglectee-isolate) was determined for each pupil 
during each of the three years of the study. Pupils 
were then classified into three groups: those mov- 
ing to a lower sociometric category, and those stay- 
ing to a higner sociometric category, those moving 
to a lower socion.etric category, and those stay- 
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TABLE 13 


ABILITY GROUPING 


BETWEEN- TREATMENT DIFFERENCES IN SOCIOMETRIC STATUS MOBILITY AT EACH ABILITY LEVEL 


Group 


Superior pupils 

Average pupils 

Slow pupils 

Pupils moving downward in ability level 


Year 1-2 
1958-59 
1959-60 
t-value 


Significant Chi Square Values 


Year 2-3 Year 1-3 
1959-60 1958-59 
1960-61 1960-61 
t- value t-value 
N. S N. S 
N. S N. Sy 
N. S A-10. 48 
N. S N. S 
N. S R-10, 43 


Pupils moving upward in ability level 


AIndicates that District A sample made a greater proportion of favorable changes. 


Significant at .01 level of confidence. 
CSignificant at .05 level of confidence. 


ing the same from one year to the next. Compari- 

sons were made between the first and secondyears, 
second and third years, and first and third years. 
After mobility data had been obtained for each sub- 
group, comparisons between the patterns of change 
in sociometric classification between different sub- 
groups were made. Chi-square contingency tables 

were used to determine the direction and level of 
significance of between-treatment, level, and sex 

differences in sociometric choice mobility. 


Between-Treatment Differences 


Between-treatment comparisons in mobility first 
were made between the total samples inthe two dis- 
tricts. The difference insociometric status mobil- 
ity between Districts A and R from the first to the 
second year was Statistically significant beyond the 
.001 level (chi-square of 14.94). Differences in 
mobility patterns between years two and three and 
years one and three were not statistically signifi- 
cant. Examination of the significant contingency 
table indicated that District A pupils were more mo- 
bile during this period. This mobility was inan up- 
ward direction for the most part and reflects the 
marked readjustments in sociometric status that oc- 
cur when an ability grouping system is adopted. 
Differences between the second and third year were 
much smaller. This suggests that by the second 
year, adjustments in sociometric status that were 
brought about as a result of introducing the ability 
grouping system had largely taken place and the new 
status pattern that had emerged might be expected 


to remain relatively stable. 

In order to get additional insight into between- 
treatment differences in sociometric mobility, these 
data were analyzed further by comparing the socio- 
metric status of pupils of the same ability level un- 
der the two grouping systems using chi-square con- 
tingency tables. Because of a general absence of 
sex differences on this variable, boys and girls were 
combined for this analysis. In addition to compar- 
ing superior, average, and slow pupils inthe two 
districts, pupils in each district who moved down- 
ward or upward in ability level were also compared 
to determine whether such movement in ability class- 
ification was accompanied by changes in sociometric 
status (see Table 13). 

Comparison of pupils of superiorability in the 
two districts yielded no significant differencesin 
mobility of sociometric status. There was a slight 
trend evident between years one andtwo for District 
A superior pupils to make more status gains and 
less status losses than were made by District R su- 
perior pupils during this same period. This may 
reflect a tendency for superior pupilstoregain some 
of the sociometric status they initially lose in an 
ability grouped situation. 

Comparisons between the sociometric status mo- 
bility patterns of average pupils show District A pu- 
pils to be significantly more mobile between years 
one and two than comparable District Rpupils. This 
mobility was generally upward and reflects an im- 
provement of the average pupils’ status during the 
second year of their experience inthe ability grouped 
Situation. The difference between pupils of low 
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ability in the two districts in mobility of sociomet- 

ric status also reached statistical significance 

in comparisons between years one and two. These 

data seem to support the conclusion that the slow pu- 
pil has a greater opportunity for gaining social ap- 

proval and recognition in an ability grouped class- 
room than he does in a random grouped classroom. 
The absence of any significant between-treatm ent 

differences in mobility between the second and third 
year of the study again suggests that adjustments in 
sociometric status brought about by ability grouping 
are largely completed by the second year after its 

introduction. 

We were also interested in learning what hap- 
pened to a pupil’s social status in the two grouping 
systems when his ability level changed. In District 
A such a change meant a change inclassroom as- 
signment, while in District R the pupil would be un- 
aware of the change. First let us consider pupils 
who received a lower ability level classification in 
a succeeding year. District A pupils in this classi- 
fication experienced significantly more upward mo- 
bility and less downward mobility than was experi- 
enced by the comparable District R pupils. This 
same trend prevailed in comparisons between the 
second and third and first and third years although 
neither attained statistical significance. The move- 
ment downward from one ability level classification 
to another in the ability grouped situation would us- 
ually result in a pupil being at the upward extreme 
of his new ability classification. Thus, the down- 
ward movement seemed to increase his relative soc- 
ial status as he was recognized as having come from 
a higher ability group and was near the top of his 
new group whereas he had been near the bottom of 
his previous group. In randomly grouped class- 
rooms it will be recalled that the pupil’s ability lev- 
el classification was unknown to him. He was not 
subject to room reassignment as a result of obtain- 
ing lower ability classification. These data would 
suggest that any damage tothe ability grouped pu- 
pil’s self-concept that might occur because of his 
reclassification to a lower ability section might be 
compensated to some extent by gains in social sta- 
tus he is likely to experience in his new class. 

Finally, pupils in District A and District R who 
moved upward in ability level classification were 
studied. Two of the three chi-square comparisons 
made between these pupils yielded statistically sig- 
nificant differences. In each instance, pupils inran- 
dom grouped classrooms showed more favorable 
sociometric status changes than comparable pupils 
in District A classrooms. This difference may well 
have been caused by the same factor that appeared 
to operate on ability grouped pupils who moved down- 
ward in ability level. That is, when the pupil is 
moved from the upper extreme of one ability group 
and placed in the lower extreme of the next higher 
group his relative status within the classroom may 
suffer. These findings suggest that although ability 


TABLE 14 


CORRELATIONS BETWEEN SOCIOMETRIC STA - 
TUS AND OTHER VARIABLES FOR EACH 
DISTRICT 


Total Sociometric Status 


Other Variables District R District A 


Mental Ability 


Language Mental Age . 3964 . 052 
Non- Language Mental Age . 2394 . 002 
Academic Achievement . 1614 . 085 


Desirable Attitudes 


Toward School 2348 121) 

Toward Teachers . oggb .113 

Toward Peers . 3552 .3114 
Concept of Self . 1614 . 1944 
Acceptance of Self 084 1415 
Ideal Self 2264 .129b 
Discrepancy .013 . 079 


Asignificant at .01 level of confidence. 
Significant at .05 level of confidence. 


grouping might initially provide greater opportuni- 

ties for social leadership for those of average and 
low ability, it might also make more difficult the 
task of maintaining or gaining social recognition ifthe 
pupil makes achievement gains sufficient toraise 

his ability level classification. Loss of social sta- 
tus that is likely to accompany his reassignment to 

a new classroom might outweigh any gain in status 

he may experience from being identified as a mem- 
ber of the brighter group. 


Relationships Between Sociometric Status and Other 
Variables 


Correlations between sociometric status and 
mental ability measured by the California Test of 
Mental Maturity showed significant relationships 
for District R pupils on both the language and the 
non-language scores, while relationships between 
these variables in District A were close to zero 
(see Table 14). Thus, mental ability is related to 
sociometric status in random grouped classes but 
not in ability grouped classes. These results sug- 
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gest that the relationship between mental ability and 
sociometric status reported in previous research 
may be dependent upon the type of classroom organ- 
ization. A number of researchers have reported 
that children with higher levels of intellectual abil- 
ity tend to receive more sociometric choices than 
children of lower ability (39, 52, 69, 13, 14, 15). It 
is, of course, difficult tointerpret relationships be- 
tween mental ability and sociometric choice in abil- 
ity grouped classrooms because pupils in suchclass- 
rooms are limited in their choice to other pupils in 
a restricted ability range. 

Correlations were computed between sociometric 
status and academic achievement, using acomposite 
score based on four of the Sequential Tests of Edu- 
cational Progress. The results followed the same 
pattern as the correlations with mental ability, with 
a Significant relationship found inthe District R 
sample and no significant relationship for the Dis - 
trict A sample. Correlations between sociometric 
status and both intelligence and achievement are re- 
duced by the restriction of ability range in the Dis- 
trict Aclassrooms. These data suggest, however, 
that in ability grouped classrooms the pupil’s ability 
or achievement level is not a factor in determining 
his sociometric status, while these variables appear 
to exert some influence onstatus inrandom grouped 
classrooms. 

Correlations between sociometric status and the 
attitudes of pupils toward school, their teacher, 
and peers were also computed. Low but significant 
relationships generally existed between desirable at- 
titudes and sociometric status with the highest cor- 
relations being found between sociometric status 
and attitudes toward peers. As the pupil’s attitude 
toward his peers is almost certainly influenced by 
his sociometric status, it is not surprising that a 
significant correlation was found between these two 
variables. 

Low but significant relations were also found be- 
tween the pupil’s sociometric status andthe various 
aspects of self concept as measured by Bills’ Index 
of Adjustment and Values. These correlations are 
very low, however, ranging from. 08 to. 23 and re- 
flecting very slight relationships between the meas- 
ures. The presence of these relationships, how- 
ever, suggests that somewhat larger correlations 
may exist between the characteristics themselves. 
This speculation, however, must await the develop- 
ment of more sophisticated measures before it may 
be tested. 


SUMMARY AND CONCLUSIONS 


The sociometric data collectedfor Sample IV pu- 
pils during their fourth , fifth , and sixth grade 
years, and for Sample VI pupils during their sixth- 
grade year seem to provide considerable evidence 
in support of the following conclusions relating to 
sociometric choice in ability grouped and random 


grouped classes during the intermediate grades: 

1. The overall proportions of stars, regulars, 
and neglectees in District A and Rwere similar, 
thus leading to the conclusion that ability grouping 
does not result in a permanent leadership vacuum in 
groups of average and slow pupils. Evidence sug- 
gests, however, that some time is requiredfor abil- 
ity grouped average and slow fourth grade pupils to 
emerge as sociometric stars, thus creating a tem- 
porary peer group leadership vacuum at this grade 
level. No such temporary vacuum was evident for 
Sample VI during the first year of the study, sug- 
gesting that this temporary leadership vacuum if 
present at all, is of much shorter durationfor sixth- 
grade pupils. 

2. Superior students lose some sociometric sta- 
tus when placed in ability-grouped classrooms. 
This loss is particularly evident in the star classi- 
fication but is not accompanied by any increase in 
the neglectee-isolate classification. Dataonmobil- 
ity of sociometric status suggests that superior pu- 
pils regain some of this status loss after 
they have made an adjustment to the ability grouped 
situation. 

3. Average and slow pupils appear to have a far 
better chance of gaining social recognition in ability 
grouped classrooms than do comparable pupils in 
random grouped classrooms. For the slow pupil, 
ability grouping not only appears to increase the pu- 
pil’s chances of being classified as a star, but also 
reduces his chances of being classified as a neg- 
lectee-isolate. Sociometric status mobility patterns 
between District A and R demonstrate agreater num- 
ber of status changes in ability grouped classrooms. 
The trend of these changes is generally toward high- 
er sociometric status, further supporting the con- 
clusion that organization of pupils into ability 
grouped classes brings about marked changes in 
sociometric status patterns during the first one or 
two years and generally leads to permanently higher 
mean sociometric status for average and slow 
groups. 

4. In random grouped classrooms, the pupil’s 
ability level appears to be an important factor in de- 
termining his sociometric status. Sociometric sta- 
tus patterns differ considerably for pupils of differ- 
ent ability levels in random grouped classrooms, 
with pupils of higher ability consistently obtaining 
more favorable mean scores. The presence of sig- 
nificant correlations obtained when sociometric sta- 
tus was related to ability and achievement for Dis- 
trict R pupils further demonstrates the importance 
of ability to status in the random grouped class. 

0. Ability does not appear to be animportant fac- 
tor in determining sociometric status in ability 
grouped classrooms. In the Utah study sociometric 
status patterns for pupils of different ability levels 
in ability grouped classrooms were closely compar- 
able, showing no evidence of real difference. The 
absence of correlations between ability and status 
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suggests that these factors are far less important 


to friendship choices in ability grouped classrooms. 


6. The lack of any consistent significant differ- 
ences between boys and girls suggests that the ef- 
fects of ability grouping and random grouping on 
sociometric variables are about the same for both 
sexes. Thus, it may be concluded that sociometric 
status and mobility of sociometric status are essen- 
tially unrelated to sex in either random grouped or 


ability grouped classrooms at the intermediate 
grade levels of the elementary school. 


7. Low, but significant correlations exist be- 
tween sociometric status, pupil attitude, and self- 
concept variables. These results demonstrate the 
presence of relationships betweenthe pupil’s social 
acceptance and his perception of himself, his school 
environment, and his peers. 


This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:32:32 UTC 
All use subject to http://about.jstor.org/terms 


CHAPTER 8 


PUPIL ATTITUDES '° 


THE POSSIBLE EFFECTS of ability grouping on 
the pupil’s attitudes toward school has been a major 
concern of educators who have been interested in 
employing ability grouping in their schools. The 
atmosphere in which the pupil finds himself seems 
to differ considerably in the ability grouped and ran- 
dom grouped situation as well as being somewhat 
different at the different levels of the ability grouped 
situation. Thus, one might well expect to find dif- 
ferences in the manner that pupils react to and per- 
ceive the school environments that these treatments 
provide. 


WORKING HYPOTHESES 


Three related attitude areas are sampled by the 
USU School Inventory. These are Attitude Toward 
Peers, Attitude Toward Teacher, and Attitude To- 
ward School. Working hypotheses relative to these 
three scales and the other related measures used in 
the Utah study will now be discussed. 


Attitude Toward Peers 


1. It was hypothesized that no significant differ- 
ences in attitude toward peers would occur between 
superior pupils in the two grouping treatments. Su- 
perior District R pupils might be expected to have 
more favorable attitudes toward peers because of the 
greater opportunities these pupils have of gaining 
social recognition in the random groupedclassroom. 
On the other hand, however, superior pupils in Dis- 
trict A probably have somewhat more incommon 
with their peers such as similarity of achievement 
values, better identification with each other, and 
greater mutual respect that could lead to more fav- 
orable attitudes. It was considered likely that these 
two influences would offset each other thus leading 
to the working hypothesis. 

2. It was hypothesized that District A pupils of 
average and low ability would have more favorable 
attitudes than comparable District R pupils. Both 
of the aforementioned influences appear to operate 
in the same direction for average and slow pupils. 
That is, these pupils would have greater opportuni- 
ties for peer recognition in the ability groupedclass- 
room and also would experience more similarity be- 
tween their interests and values and those of their 
peers of comparable ability. Greater differences 
would be expected to occur between the slow groups, 
because of the relatively greater improvement of the 


ability grouped pupil’s peer groupstatus at this abil- 
ity level. 

3. It was hypothesized that no within-treatment 
differences would occur on this score between Dis- 
trict A pupils at different ability levels. The fac- 
tors hypothesized to influence between-treatment 
differences in attitudes toward peers would tend to 
reduce differences between the attitudes of District 
A pupils at different ability levels. 

4. Previous research has indicated that in ran- 
dom grouped classrooms peer status is relatedto 
ability and it was therefore hypothesized that signif- 
icant differences favoring the groups of higher abil- 
ity level would be found in the within- treatment com- 
parisons of District R groups. 

0. No significant sex differences in attitudes to- 
ward peers were hypothesized. 


Attitude Toward the Teacher and the School 


Most of the same differences between treatments 
that seem likely to influence the pupil’s attitudes to- 
ward teachers appear to operate in the same fashion 
on his attitudes toward school. Thus, the same dif- 
ferences were hypothesized for these two subscales. 

6. It was hypothesized that both superior and 
slow pupils would have significantly more favorable 
attitudes toward their teachers and school in ability 
grouped classrooms than in random grouped class- 
rooms. It would be expected that attitudes toward 
the teacher would be more favorable in situations 
where the teacher can give more attention to in- 
dividual pupil problems and in situations where tie 
curriculum makes it possible for the teacher to lead 
the pupil to a greater number of satisfying or need- 
fulfilling experiences andasmaller number of frus- 
trating experiences. Ability grouping has been ad- 
vocated because of the apparent opportunities for the 
teacher to provide such situations in more homoge- 
neous classes. Because both grouping situations 
provide considerable need-fulfillment for the high 
achieving pupil, it was expected that the largest dif- 
ferences would be found between slow pupils inthe 
two districts. 

7. No differences between average pupils in the 
two grouping treatments were hypothesized because 
the curriculum in random grouped classrooms is us- 
ually well adapted for average pupils, thus suggest- 
ingthat little real difference between the treatments 
exists at this ability level. 

8. Since the ability grouped classroom would ap- 
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pear to adapt to the needs of pupils at all three lev- 
els with about equal effectiveness, no significant 
within-treatment differences in attitude toward the 
teacher were hypothesized for District A. 

9. It was hypothesized that superior and average 
pupils in District R would exhibit similar attitudes 
toward the teacher while the attitudes of slow pupils 
would be significantly less favorable. In District R, 
the average pupil would seem to have the fewest 
frustrations because the rate and level of teaching 
is well adapted to his abilities. The high- achieving 
pupil, however, also experiences considerable need- 
fulfillment in the random grouped classroom even 
though the curriculum may be geared at arate below 
his capacity. He receives considerable recognition 
and also can gain satisfaction from working on en- 
richment materials available. 

10. There seems to be some evidence that girls 
tend to have generally more favorable attitudes to- 
ward the total school situation than boys. Thus, it 
was hypothesized that for both districts significant 
differences favoring girls would occur in the be- 
tween sex comparisons on this scale. 


PROCEDURE 


The USU School Inventory was administered to 
Sample IV pupils during their sixth grade year; 311 
pupils were tested in District A and 617 in District 
R. As a measure of school attitudes meeting the re- 
quirements of this study could not be found, it was 
decided to develop sucha scale. Alarge number of 
statements expressing various ideas and feelings 
about school-related topics were collectedfrom both 
teachers and pupils. These statements were re- 
viewed and rewritten ina question form that permit- 
ted answering ‘‘yes’’, ‘‘no’’, or ‘‘?’’. After pre- 
test and item analysis, 95 items were retained. 

The reliability coefficient for the 95 items on the 
total scale was .94, using the split-half correlation 
corrected for length with the Spearman- Brown 
Prophesy Formula. Reliabilities of the subscales 
were .95, .91, and. 82. 

Product moment correlation coefficients were al- 
so calculated between scores on the three subscales 
to determine their level of independence. The cor- 
relation coefficient between school attitude and peer 
attitude was . 54; between attitude toward teacher 
and toward peers was . 46; and between attitude to - 
ward the teacher and toward schoolwas .57. In 
view of the fairly high reliabilities of the subscales, 
these intercorrelations suggest that the three sub- 
scales of the USU School Inventory though related, 
are measuring somewhat different facets of the pu- 
pil’s attitude. 


RESULTS 


The results of the attitude study were analyzed 
using a single classification analysis of variance to 


compare attitude scores of pupils in the two group- 
ing treatments, at different ability levels andof dif- 
ferent sex. 


Attitude Toward Peers 


Comparisons between pupils of similar ability in 
District A and R on the Attitude Toward Peers scale 
revealed no differences approaching significance nor 
did the slight differences found occur consistently in 
favor of one district or the other. Thus, the two 
grouping treatments had no significant differential 
effect upon this attitude as measured by the USU 
School Inventory. Within-treatment comparisons on 
the Attitude Toward Peers scale generally supported 
the working hypotheses. Four of the six differences 
between pupils of different ability levels in District 
R were statistically significant with the higher abil- 
ity group having more favorable attitudes in all 
cases. Within-treatment comparisons for District 
A pupils revealed much smaller differences with on- 
ly one of the six differences reaching statistical sig- 
nificance at the .05 level. These data suggest that 
attitudes of pupils toward their peers are rather con- 
sistently related to ability in random grouped 
classes. Sex differences were slight on this varia- 
ble, and no significant sex differences were found. 


Attitude Toward the Teacher 


The pupil’s attitude toward his teacher was con- 
sidered the most important of the three areas that 
the USU School Inventory was designed to measure. 
Of the 95 items on the inventory, 50 were aimed at 
measuring this aspect of pupil attitude. Both supe- 
rior groups in District A scored significantly higher 
on this subscale than comparable District R pupils. 
Average pupils in the two districts received very 
Similar mean scores with differences falling far 
short of statistical significance. Slow boys in Dis- 
trict A received significantly more favorable scores 
than comparable District R pupils but slow girls 
were not Significantly different in the two districts. 
Thus, the working hypotheses were well supported 
for superior and average pupils and partially support- 
ed for slow pupils. 

Comparisons between pupils of different ability 
levels on the Attitude Toward the Teacher score re- 
vealed three statistically significant differences in 
the six comparisons made for District R pupils. 
These differences all involved superior groups and 
indicated a tendency for superior pupils in District 
R to have somewhat more favorable attitudes than 
either average or slow pupils. InDistrict A no sig- 
nificant within-treatment differences were hypothe- 
sized. The results, however, indicated four signif- 
icant differences on the six comparisons. Three of 
the four comparisons involving superior students 
were Significant, in all cases favoring the superior 
group. The fourth significant difference was be- 
tween average and slow boys; itfavored the slow 
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group. Slow girls in District A also showed more 

favorable attitudes toward the teacher thandid aver- 
age girls although this difference did not reach sta- 

tistical significance. No significant sex differences 
en.erged on this variable although five of the non- sig- 
nificant differences favored girls as hypothesized. 


Attitude Toward School 


It was hypothesized that Attitude Toward School 
scores would favor District A pupils incomparisons 
at the superior and slow ability levels and that no 
differences would occur between average pupils in 
the two treatments. Only one significant difference 
emerged. This difference was between slow boys in 
the two districts and favored District A pupils at the 
. 01 level. 

No within-treatment differences were expected 
on Attitude Toward School scores for District A. 
Three significant differences occurred, allfavoring 
superior pupils as compared with pupils of average 
or low ability. These differences probably are a 
reflection of the need-fulfillment provided in both 
districts for high achieving pupils. Within-treat- 
ment comparisons in District R showed a similar 
pattern with four significant differences emerging, 
all of which favored the group of higher ability. 

It was hypothesized that girls would have signifi- 
cantly more favorable attitudes toward school in 
both districts than boys of comparable ability. 
Three of the six comparisons betweenthe sexes 
yielded statistically significant differences, all in 
the hypothesized direction. These data provide fair- 
ly clear evidence that girls may be expected to show 
more favorable attitudes toward school than boys, 
especially at the superior level. 


SUMMARY AND CONCLUSIONS 


Pupils in Sample IV were administered the USU 
School Inventory during their sixth grade year. The 
USU School Inventory contains three scales: Atti- 
tude Toward Peers, Attitude Toward the Teacher, 
and Attitude Toward School. 

Comparisons between the two grouping treat- 
ments, between pupils of different ability levels 
within each grouping treatment, and between com - 
parable pupils of the opposite sex were made using 
single classification analysis of variance. Signifi- 
cant differences between mean scores of various 
subgroups on the aforementioned measures form the 
basis for our con¢lasions. 

Results on the Attitude Toward Peers scale re- 
vealed no significant between-treatment differences. 
Four of the six within-treatment differences in Dis- 
trict R were significant, favoring the higher ability 
group in each case. No significant sex differences 
were found on this scale. These results suggest 
the following conclusions: 

1. The two grouping treatments had no signifi- 


cant effect on attitudes of pupils toward their peers. 

2. Attitudes toward peers were found to be con- 
sistently related to ability in the random grouped 
classrooms while no such relationship was found in 
the ability grouped classrooms, suggesting that abil- 
ity is more important in determining peer attitudes 
in the random grouped situation. 

3. There was no relationship between sex and at- 
titudes toward peers. 


Comparison on the Attitude Toward the Teacher 
scale showed that superior girls and boys and slow 
boys in District A received significantly more fav- 
orable scores than comparable District A pupils. 
No significant differences were found between aver- 
age pupils in the two districts. Within-treatment 
comparisons between pupils at different ability lev- 
els showed three significant differences in District 
R, all favoring the superior groups. There were 
four significant differences between District A pu- 
pils of different ability levels, three favoring supe- 
rior groups and one favoring slow boys over average 
boys. No significant sex differences were found in 
this attitude area. From these data it may be con- 
cluded that: 

4. Ability grouping appears to cause more fav- 
orable attitudes toward the teacher than random 
grouping for both superior and slow pupils. 

o. Superior pupils in bothdistricts indicated 
more favorable attitudes toward the teacher than pu- 
pils at the lower ability levels. 

6. There was some tendency in District A for 
slow pupils to have more favorable attitudes toward 
the teacher than average pupils, thus supporting the 
conclusion that ability grouping has a favorable ef- 
fect upon the attitudes of slow pupils toward the 
teacher. 

7. There were no differences between the mean 
attitudes of comparable boys’ and girls’ groups to- 
ward the teacher. 


Comparisons between the two districts on the At- 
titude Toward School scale showed one significant dif- 
ference favoring slow boys in District A. Inboth 
districts there were significant differences between 
pupils of different ability levels, all indicative of 
more favorable attitudes for pupils of higher ability. 
Three of the six sex differences on the Attitude To- 
ward School scale were statistically significant. 
These differences, all of which favored girls, were 
largest at the superior level. It may be concluded 
that: 

8. Boys of low ability developed more favorable 
attitudes toward school in ability grouped class- 
rooms than in random grouped classrooms. 

9, Superior pupils had more favorable attitudes 
toward school in both treatments than pupils of low- 
er ability. 

10. Superior girls had more favorable attitudes 
toward school in both districts than boys of compa- 
rable ability. 
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CHAPTER 9 


PUPIL PROBLEMS 


THE MAIN purpose of this chapter is to compare 
the problems reported by pupils in ability grouped 
and random grouped classes. The instrumentsem- 
ployed were the SRA Youth Inventory and the SRA 
Junior Inventory. These measures are essentially 
check lists each of which classifies the problems 
listed into a number of areas. A study of these 
problem areas on the two inventories revealed sev- 
eral areas in which one of the grouping treatments 
employed in the Utah study might be expected to 
lead to greater numbers of pupil problems than the 
other grouping treatment. 


WORKING HYPOTHESES 


As no previous research has been reported in 
this area, working hypotheses were established by 
weighing the possible psychological effects of abil- 
ity grouping and random grouping on the various 
problem areas covered on the SRA Junior Inventory. 

The ‘‘Looking Ahead’’ area of the SRA Youth 
Inventory is concerned primarily with problems re- 
lated to choosing a career, going to college, and 
finding a job (94). Many of the problems included 
in this section are concerned withthe pupil’s insight 
into his ability. 

1. It was thus hypothesized that District A pu- 
pils would indicate fewer problems inthis areathan 
District R pupils at all ability levels with greatest 
differences occurring between superior and average 
subgroups in the two districts. It appears likely 
that the ability grouping situation gives pupils at all 
levels a sounder basis for estimating their ability. 
Superior and average pupils should be better able 
than slow pupils to translate this information into 
answers to the sort of questions raised inthis sec- 
tion of the Youth Inventory. 

The next problem area to be discussed, ‘‘Getting 
Along with Others’’, is found in both of the SRA in- 
ventories. In the Junior Inventory, the majority of 
the items relate to social adjustment problems, 
while in the Youth Inventory, social skills problems 
are emphasized (95). 

2. It was hypothesized, however, that District 
R pupils would have a greater number of problems 
on both scales than the comparable District A pu- 
pils. Because of the reduced chances of social re- 
jection stemming from ability differences, it would 
seem that social adjustment problems wouldbe less 
serious in District A, especially for pupils of low 
ability. Social skills appear less likely to be in- 


fluenced by the grouping treatment, but again it ap- 
peared probable that the greater opportunities 
in District A for pupils at the average and slow lev- 
els to gain leadership positions and play more active 
roles in social activities, would tend to result ina 
lower frequency of social problems. 

The ‘‘Boy Meets Girl’’ area of the SRA Youth In- 
ventory is concerned with dating and sex informa- 
tion, and sex mores. It seemed unlikely that the 
grouping treatments would have any effect onthe 
latter two topics. With regard to dating practices, 
however, it appeared likely that District A pupils 
would report fewer problems. Ability grouping at 
all levels would appear to place pupils in contact 
with more individuals having similar interests and 
aspirations. Also, the more favorable social cli- 
mate present in the average and slow ability grouped 
sections should tend to reduce dating problems. 

3. Thus, it was hypothesized that between-treat- 
ment differences in the ‘‘Boy Meets Girl’’ problem 
area would generally favor pupils in ability grouped 
classes. 

The ‘‘Basic Difficulty Key’’ for the SRA Youth In- 
ventory is made up of those items considered by 
mental hygiene experts to be indicative of possible 
personality disturbance (94). Although our working 
hypotheses on the SRA problem areas generally hy- 
pothesized more problems for pupils in random 
grouped classes, large differences were not expect- 
ed, and there seemed to be little reasonto expect 
either grouping system to bring about serious per- 
sonality disturbances. 

4. Thus, it was hypothesized that no significant 
between-treatment differences would occur onthe 
‘*Basic Difficulty Key’’. 

Three of the problem areasfound inthe SRA 
Junior and Youth Inventories were administered 
only because removing them from the instrument 
would have exposed subjects to a non- standard ad- 
ministration of the measure that could in turn have 
changed responses in some of the areas where di- 
rectional hypotheses were established. No working 
hypotheses were developed for these areas. Other 
problem areas such as the ‘‘My School’’ scale, and 
the ‘‘About Myself’’ scale were measured more ad- 
equately by other instruments in the Utah study. 11 


PROCEDURE 


Sample 
The SRA Junior Inventory, FormS was adminis- 
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tered to Sample IV pupils in Districts Aand R during 
their fifth-grade year. The SRA Youth Inventory, 
Form Awas administered to Sample VI pupils during 
their seventh grade year, Sample VII pupils during 
their eighth grade year, and Sample VIII pupils dur- 
ing their ninth grade year. It will be noted that for 
all samples, these data were collected during the 
second year of the study, after District A pupils had 
been exposed to about one and one-half years in the 
ability grouping situation. 


Measures 


The problem areas in the SRA Junior Inventory 
include the following: ‘‘About Me and My Home’’, 
‘*About Myself’’, ‘‘Getting Along With Other People’? 
and ‘‘Things in General’’. The SRA Junior Inven- 
tory, Form S provides a means for the pupil to in- 
dicate the seriousnessofeachproblem. In addition 
to counting the number of problems checked in 
each of the aforementioned areas, a count of each 
pupil’s ‘‘big problems’’ was made. 

The SRA Youth Inventory, Form A was adminis- 
tered to the junior-high-school samples. The Youth 
Inventory is similar to the Junior Inventory except- 
ing for differences necessary to better fit the prob- 
lems of junior and senior high school pupils. On 
the Junior Inventory, Form Sthe pupil may indicate 
the seriousness of his problems, while this cannot 
be indicated on the Youth Inventory, Form A. 

The Youth Inventory contains items ineight prob- 
lem areas. These areas are entitled: ‘‘MySchool’’, 
‘Looking Ahead’’, ‘‘About Myself’’, ‘‘Getting Along 
With Others’’, ‘‘My Home and Family’’, ‘‘ Boy Meets 
Girl’’, ‘‘Health’’, and ‘‘Things in General’’. In ad- 
dition to these eight problem areas, which containa 
total of 298 items, a ‘‘Basic Difficulty Key” has 
been developed which provides another score onthe 


Youth Inventory. 
RESULTS 


The first score to be discussed, ‘‘ Looking 
Ahead’’, is not included in the Junior Inventory, as 
questions are concerned primarily with problems 
related to choosing a career, going to college and 
finding a job. Typical items inthis section include: 
‘‘How much ability do I actually have?’’, ‘‘Shall I go 
to college?’’, ‘‘Do I have the ability to do college 
work?’’, ‘‘What jobs are open to high school grad- 
uates?’’ (95). 

Of the 18 between-treatment com parisons of 
group means in this problem area, there were nine 
differences that reached or approached statistical 
significance, all favoring the District A subgroups 
(see Table 15). Asanumber of the questions in this 
area are concerned with ability it is not surprising 
to note that the pupils in ability grouped classrooms 
reported fewer problems in this area than pupils in 
random grouped classrooms. This difference was 


found most commonly among superior boys and girls 
and average boys and may reflect a tendency for the 
District R pupils to have somewhat more doubts as 
to the level of their ability, thus leading to a larger 
number of problems. 

The ‘‘Getting Along With Others’’ section of the 
Youth Inventory is concerned with problems related 
to the pupil’s social development with considerable 
emphasis upon social skills. Typicalofthe items in 
this area are: ‘‘I wish I could carry ona pleasant 
conversation’’, ‘‘I want to learn to dance’’, and ‘“‘I 
need to learn how to plana party’’. Social skill items 
make up about two-thirds of the 40 items inthis area. 
Most of the remaining items in this section deal 
more directly with social adjustment, for example: 
‘‘! wish I were more popular’’, and ‘‘Iam not attrac- 


tive to the other sex’’ (94). 


The SRA Junior Inventory contains 34items in 
this area. In contrast to the Youth Inventory, the 
great majority of Junior Inventory items relate to 
social adjustment problems both withadults and 
peers, with very few pertaining directly to social 
skills. In view of these differences inthe two meas- 
ures, Sample IV data are not directly comparable to 
data from Samples VI, VII, and VIII. 

Between-treatment comparisons for Sam ple IV 
showed District A average pupils, both boys and 
girls, to have significantly higher mean problem 
scores in this area than comparable District R pu- 
pils. Although not significant, this same trend was 
found in the mean scores of slow boys and girls. 

Results for Samples VI, VII, and VIII who were 
administered the Youth Inventory, showed atrend for 
District R pupils to have more problems inthis area 
at all three ability levels. Of the 18 comparisons 
made at these grade levels three were statistically 
significant and four approached significance, all in 
the aforementioned direction. Itwillbe noted in Ta- 
ble 15 that this trend is no more pronounced at one 
ability level than another. 

These data would indicate that at the elementary 
level, District A pupils listed more social adjust- 
ment problems than their District R counterparts, 
while at the secondary level, District R pupils listed 
more problems on the Youth Inventory which, it will 
be recalled, deals more with social skills than soc- 
ial adjustment. The latter trend is consistent enough 
to indicate a true difference that is general at all 
three ability levels. 

The ‘‘Boy Meets Girl’’ areais not included in the 
SRA Junior Inventory. Most ofthe 32 items included 
in this area of the Youth Inventory are concerned 
with either dating or sex informationand mores with 
about equal coverage of these two topics. Typical 
items on dating include: ‘‘I don’t know how to ask 
for a date’’, and ‘‘I don’t know what todo on a date’’. 
Typical items on sex include: ‘‘I am embarrassed 
by any discussion on sex’’, and ‘‘Ineed more inform- 
ation about sex’’ (94). 

Review of between-treatment differences showed 


This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:32:32 UTC 
All use subject to http://about.jstor.org/terms 


61 


‘$IOJUIAUT JOTUNL UO SUTaTqord SnoTIag 5 
‘QN[VA ,,},, BUTATS PUB PAaJVOIPUT JOTIJSIP BUIIOAVJ JUBDTJIUSIS SdnOIS pa}eVdIpUl UdaMJaq BdOUaTATFIG q 
‘(YW IO Y) PAYVOTPUT JOTASTP SUTIOAV] aOURdTJIUSIS SuTyovordde sdnois pa}edIpul UsaMjaq BdUaTETI x 


€0 ‘2 60 °Z rE OTyer-} 
SN SN SN SN u SN SN u SN SN SN rs | SN SN S[TITD MOTS 
SV GG ‘2 L9 ‘2 OT}JVI-} 
u SN SN SN SN SN SN uU SN SN SN u SN SN sfogq MoTS 
OL 2 99 ‘2 SV LI‘Z ZO °S SV SI‘ OT}B.1-} 
wn U SN SN V u u u SN SN uU V SN SN SN S[IID) asVlaay 
= 
4 
5 SV GZ's 6L °Z SV 62% SV LO °Z 78 ‘2 OT}JV1I-} 
mr 
Ay SN ua SN V SN SN +S SN SN +S V u u u skog asviaay 
— 
Ps 
Ay SV SV LO °@ 6L GC SV 89 2 80 “2 OT}BI-} 
u V SN SN a SN a SN SN u SN u SN qu STIID roTzadng 
SV SV Zo S Pe 2 96 ‘T SV SV SV OT}JeI -} 
U SN +S SN +S SN +S SN u U SN SN +S eu skog 1otzedng 
IIIA IIA IA 5AI IILA IIA IA IIIA IIA IA AI IIIA IIA IA oTduares dnory 
Aay AYMIGFI Itseg [4ID syaay Aog SI9yIO UM SUOTY 38uT}}a5 pesyy sutyooT 


SHIMOLNAANI HILNOA ANV YOINNL VHS NO SHON ATHIdG LNYN LV dal -Naa MAM dd 


GT HWIAVL 


This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:32:32 UTC 


All use subject to http://about.jstor.org/terms 


62 ABILITY GROUPING 


a consistent and marked trend for District R pupils 
to report more problems in this area than compara- 
ble pupils in District A (see Table 15). Eight ofthe 
18 between-treatment comparisons were statistic- 
ally significant and one additional approached sig- 
nificance; all in the aforementioned direction. Dif- 
ferences were generally larger and occurred most 
often between superior groups and least often be- 
tween slow groups. Four of the six comparisons be- 
tween superior pupils were statistically significant 
as compared with only one significant difference be- 
tween slow pupils. Although between-treatment dif- 
ferences were hypothesized on this variable, the 
number of significant differences that emerged was 
somewhat surprising. Further research will be 
needed to determine whether the differences found 
in the Utah study are generalizable toother samples 
and if so to explore possible reasons why the ability 
grouped situation tends to be associated with fewer 
problems being reported on this section of the 


Youth Inventory. 


The Junior Inventory, FormS, administeredto Sam- 


ple IV provided a meansfor pupils to identify the 
level of seriousness of their problems. The number 
of serious problems was obtained separately for pu- 
pils taking the Junior Inventory. Two significant be- 
tween treatment differences were found for Sample 
IV on this serious problem score, District A aver- 
age boys and girls listed a greater mean number of 
serious problems than comparable pupils in Dis- 
trict R. 

The ‘‘ Basic Difficulty Key’’ of the SRA Youth In- 
ventory includes 101 items takenfrom the Inventory 
that a panel of seven mental hygiene experts con- 
sidered to be indicative of a possible basic person- 
ality disturbance. For the most part, the remain- 
ing items on the Youth Inventory are more amatter 
of problem recognition. A number of comparisons 
of ‘‘Basic Difficulty’’ scores with external criteria 
reported in the test manual would indicate that the 
scale had some validity as a measure of overall ad- 
justment (94). 

Between-treatment results showedthat, altnough 
only one difference reached statistical significance, 
six other differences approached significance. All 
but one of these differences were found in compari- 
sons of superior and average students, and all but 
one difference indicated a higher basic difficulty 
score for pupils in District R. It is difficult to be 
sure on the basis of field study data that these dif - 
ferences came about as a result of the two grouping 
treatments. These data, however, may lead us to 
question the claim often made by educators that abil- 
ity grouping tends to cause pupil maladjustment. 


SUMMARY AND CONCLUSIONS 
During the second year of the Utahstudy, Sample 


IV pupils were administered the SRA Junior Inven- 
tory, Form 8, and Sample VI, VII, and VIII pupils 


were administered the SRA Youth Inventory, Form 
A. The results of these measures were analyzed, 
using analysis of variance in order to compare the 
mean number of problems checked by pupils of com- 
parable ability in the two districts. As the SRA 
Youth Inventory was administered to three different 
samples at the junior-high-school level, an overall 
summary of the results on this measure seems de- 
sirable. 

In reviewing the between-treatment differences 
on the problem areas of the Youth Inventory report- 
ed in this chapter, perhaps the most noteworthy 
trend is the preponderance of significant differences 
indicating larger mean problem scores for the Dis- 
trict R groups. A total of 32 between-treatment dif- 
ferences reached or approached statistical signifi- 
cance for the junior high schoolsamples. These dif- 
ferences were found in all samples. Of these differ- 
ences only one indicated more favorable problem 
scores for District R pupils while the other 31 indi- 
cated more favorable problem scores for District 
A. When the reader considers that these data are 
based upon pupils in three different junior high 
schools in each district, and when it is recalled that 
the samples were comparable on such variables as 
intelligence and sociometric status, the results 
would seem to suggest strongly that treatment differ- 
ences between ability grouping and random grouping 
in mathematics and science may have a widespread 
effect upon the number of problems perceived by jun- 
ior high school students. Whether this difference 
reflects a greater awareness of problems among 
District R pupils or reflects a tendency for ability 
grouping to reduce problems or help pupils deal with 
them cannot be answered from the data collected in 
this research. 

When the datain Table 15 are summarized by abil- 
ity level another trend emerges — the tendency for 
the frequency of significant differences between the 
districts to be related to ability level. Of the 32 dif- 
ferences reaching or approaching statistical signifi- 
cance on the SRA Youth Inventory, 15 of these differ- 
ences are found between superior pupils in the two 
districts, and in 14 of the 15 differences the District 
R pupils obtained higher (less favorable) scores. 
Eleven of the 32 differences were between average 
pupils in the two districts, with all differences indi- 
cating higher problem scores for District R groups. 
These results, reflecting as they do a consistently 
high number of problems for superior and average 
District R groups, would suggest the presence of 
true treatment differences at these ability levels. 
This trend, although present for slow pupils, is far 
less pronounced, and less general from sample to 
sample. Six differences were found between slow 
pupils in the two districts and allreflected higher 
problem scores for District R pupils. Allsixofthe 
between-treatment differences found among slow pu- 
pils, however, occurred in Sample VIII, suggesting 
that they reflect unique characteristics of the sam- 
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ple rather than a generalizabletreatment effect. 
These data support the overall conclusion that abil- 
ity grouped junior high school pupils in average or 
superior groups report fewer problems thandocom- 
parable pupils in random grouped classes. 

Let us now summarize findings in the specific 
problem areas of the two SRA inventories and see 
what conclusions may be drawn. These conclusions 
are better supported for junior high school pupils 
than for elementary school pupils, because of the 
availability of comparative data from three j unior- 
high school samples. 

Ability grouped junior high school pupils were 
found to report fewer problems on the ‘‘ Looking 
Ahead’’ scale of the SRA Youth Inventory than com- 
parable pupils in random grouped classes. These 
differences are most pronounced between superior 
boys and girls and average boys. It may be con- 
cluded that junior high school pupils in ability 
grouped classes have fewer problems relating to 
choosing a career, going to college, and finding a 
job than do comparable pupils inrandom grouped 
classes. 

The ‘‘Getting Along With Others’’ scale differs 
somewhat on the two inventories. The results 
showed two significant between-treatment differ- 
ences favoring District R pupils on the elementary 
school data. These results lead to the tentative con- 
clusion that average pupils in the ability grouped 
classes at the elementary level have more social 
adjustment problems thancomparable random 
grouped pupils. Three significant differences plus 
four differences approaching significance were found 


on the junior high school data, allindicating higher 
problem scores for District R pupils. Atthis level, 
it may be concluded that there is atendency for ran- 
dom grouped pupils at all ability levels to havea 
greater number of problems relating to social skills 
than comparable pupils in ability grouped class- 
rooms. 

Comparisons between the treatments on the ‘‘ Boy 
Meets Girl’’ scale showed eight statistically signi- 
ficant differences and one difference appro aching 
significance. These differences allindicated a 
larger number of problems for District Rjunior 
high school pupils. From these data, it may be con- 
cluded that there is a consistent tendency for jun- 
ior high school pupils in random grouped classes to 
report greater numbers of problems relating to dat- 
ing, sex information, and sex mores than compara- 
able pupils in ability grouped classes. These dif- 
ferences were most marked between superior pupils 
and least pronounced between slow pupils in the two 
treatments. 

Results on the ‘‘Basic Difficulty Key’’ of the 
Youth Inventory revealed one significant difference 
favoring District A, and five differences approach- 
ing significance in the same direction. These data 
suggest that if any significant between-treat ment 
difference in pupil adjustment was present, this 
difference favored the ability grouping system. Be- 
cause of the small differences found between the 
districts on the ‘‘Basic Difficulty Key’’ however, it 
seems safer to conclude that neither treatment is 
consistently associated with more indications of pu- 
pil maladjustment. 
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SELF-CONCEPT 


THE SELF-CONCEPT or phenomenal self as de- 
fined in contemporary psychology refers tothe in- 
dividual’s perception of himself. That is, expressed 
in simple terms, the ideas, perceptions, and beliefs 
that form the image of himself that the individual has 
created. This picture that he has of himself may be 
highly realistic in the well adjusted individual, or it 
can be completely detached from reality as in the 
case of the psychotic who believes himself tobe Na- 
poleon. Regardless of the individual’s self-concept, 
however, he tends to behave ina manner that he 
considers to be consistent with this concept. The 
self-concept is considered to be an important factor 
underlying the individual’s behavior by most con- 
temporary personality theorists. Wylie inher re- 
view of the self-concept points out that ‘‘... all the 
theories of personality which have been put forth 
Within tne last two decades assign importance to a 
phenomenal and/or nonphenomenal self-concept with 
cognitive and motivational attributes’’ (125). 

Among theorists who assignanimportant role to 
the self-concept are Adler (1), Freud (38) Horney 
(62), Lecky (71). Maslow (80), Rogers (98) and 
Snygg and Combs (106). Some personality theorists 
including Rogers, Snygg and Combs, and Lecky, 
stress the self-concept as the principal determiner of 
behavior. Generally, these theorists consider that 
the need to preserve and enhance the self-concept is 
a basic human motivation. 

Lecky (71) stresses preservationof the self-con- 
cept and relates this variable toschool behavior. He 
believes that the child who sees himself as weak, 
bad, or stupid will behave in ways that will tend to 
perpetuate this picture and refuse to accept the idea 
that he is good, strong, or clever. Jersild (65: 114) 
reflects a similar opinion concerning learning when 
he points out that ‘‘When a person resists learning 
that may be beneficial to him, he is, in effect, try- 
ing to protect or to shield an unhealthy condition. 
But, more broadly speaking, he is not actually pro- 
tecting something unhealthy as such; he is trying to 
safeguard his picture of himself, his self-concept, 
the illusions concerning himself which he has built 
and which give him much trouble. ”’ 

When we view the self-concept as adeterminer of 
behavior and as a picture that the individual will 
maintain at considerable cost but is under some pres- 
sure to change when his behavior is contradictory to 
it, we can see some definite implications of this the- 
ory to the school situation and more specifically to 
the possible effects of ability grouping. Part of the 


picture that makes upthe pupil’s self-concept is con- 
cerned with the school. This picture, of course, is 
highly complex and involves such matters as the in- 
dividual’s estimate of his achievement, his perc:p- 
tion of his status relative to the other pupils in the 

Class, his satisfaction with his school role, and his 
attitudes about study. Eachaspect of the pupil’s 
self-perception has subtle shadings and is colored by 
his value system, the pressures placed upon him by 

the home and a great many other variables. 


The relative impact of the two grouping treat- 
ments upon the individual’s self-concept appears to 
be an important factor int heir overall evaluation. 
Ability grouping and random grouping might be ex- 
pected to influence the individual’s self-concept in a 
number of ways. For example, a pupilwho has been 
unaware of possessing superior ability during the 
years he has spentinarandom grouped system may, 
upon being assigned to a superior class in an ability 
grouped system, become more actively aware of his 
academic status and incorporate t his into his self- 
concept. The pupil of low ability may similarly be 
successful in a heterogeneous class in maintaining a 
higher self-perception of his ability than the objec- 
tive reality of the situation would justify. Upon be- 
ing placed in an ability grouped system, however, in 
which this unrealistic self-concept is directly con- 
trary to his placement in a low ability group, he may 
find it no longer possible to maintain his previous 
estimate of his own ability and may thus find it nec- 
essary to revise downward some aspects of his self- 
concept. 


On the other hand, the increased competition in 
the ability grouped class may tend to lower the self- 
concept of the bright pupil while the decreased com- 
petition faced by the slow pupil may raise his self- 
concept. Another circumstance that could lead to re- 
vision of the individual’s self-concept in an ability 
grouped situation arises when his initial assignment 
to a section at one ability level is followed by subse- 
quent reassignment to a lower or higher section. 


Thus, it appears that, in comparisons between 
pupils of the same ability level in the two districts, 
two major treatment differences would operate that 
could have an effect upon the self-concept scores of 
the subjects. These differences are: 1) the aware- 
ness of the pupil in District A of the level at whichhe 
is placed, and 2) the necessity in District R for the 
pupil to compete with pupils of all ability levels. 
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WORKING HYPOTHESES 


Our estimate of the impact of these two factors 
upon the measures of self-concept employed in this 
research forms the basis for developing our work- 
ing hypotheses. 


The Index of Adjustment and Values, usedin this 
research, yields four scores. These scores are: 


1) Concept of Self, 2) Acceptance of Self, 3) Ideal 
Self, and 4) Discrepancy Score, the latter obtained 
by subtracting Concept of Self from Ideal of Self. 

Applying the aforementioned treatment differ - 
ences to superior District A pupils, we can assume 
that they would, in all probability, be aware of their 
placement in superior classes and this placement 
should tend to develop higher self-concepts in the 
individual. On the other hand, placement in the su- 
perior section brings with it more intense competi- 
tion and less of a feeling of being outstanding that 
the pupil may have experienced prior tothe adoption 
of ability grouping in District A. Because most su- 
perior pupils could be expected to be reasonably 
aware of their ability in either treatment, the latter 
factor may cause a negative net effect on self-con- 
cept. 

1. Thus, it was hypothesized that District A su- 
perior pupils would develop higher Discrepancy 
Scores and lower Acceptance of Self scores. In oth- 
er words, the bright pupil in the ability grouped 
classroom would be aware of his superiority but 
would also be aware that the other pupils in his 
group were equally or perhaps more superior and 
this awareness would lead him to be less self satis- 
fied that he would be in a random grouped class. 

2. It was hypothesized that average pupils in 
District A would obtain lower scores inSelf-Concept 
and Acceptance of Self and higher Discrepancy 
Scores because the individual wouldbe more active- 
ly aware of his average status and less likely to 
overestimate his ability than would be the case in 
District R. Thus, District A average pupils would 
be somewhat obliged to face reality, but would be in 
little better competitive position than if they were in 
a heterogeneous classroom. 

It was considered that the same factors hypothe- 
sized to influence the self-concept of superior pupils 
would act somewhat differently upon the slow pupil. 
Identification with the slow groupwouldtend to lower 
the pupil’s self-concept while membership inthe ho- 
mogeneous class would make him better able tocom- 
pete and this better competitive position might tend 
to enhance his self-concept. It wasfurther rea- 
soned, however, that the stigma of being identified 
with the slow group would out-weigh the advantages 
gained by the individual’s better competitive position. 

3. Thus, it was hypothesized that slow pupils in 
District A would have lower Self-Concept scores, 
lower Acceptance of Self scores and higher Dis- 
crepancy Scores than would slow pupils in District R. 

It will be noted that no working hypotheses were 


developed for the Ideal Self score, as this variable 
was not expected to change as a result of the group- 
ing situation. 


PROCEDURE 


The Sample 


Pupils in all five samples were administered the 
IAV. Samples IV and VI were eachtested three 
times over a three year period, while Samples VII, 
VIII and IX were each tested twice over atwo year 
period. The approximate number of cases for each 
sample may be found in Table 11. 


The Measure 


The principal measure of self-concept employed 
in this research was the Index of Adjustment and 
Values, developed by Robert E. Bills (11). The In- 
dex of Adjustment and Values is currently available 
in four forms: 1) the elementary school form to be 
used in grades three, four and five, 2) the junior 
high school form for grades six, seven and eight, 
3) the high school form for grades nine, ten and 11, 
and 4) the original adult form which is also used in 
Grade 12. 

The high school form contains 37 trait words, the 
junior high school form contains 35 words, and the 
elementary school form contains 19. In allofthese 
forms the pupil answers three questions about him- 
self: he rates himself with respecttothe trait word 
(concept-of self), tells how he feels about being this 
sort of person (acceptance-of-self), and how he would 
like to be with respect to each trait (ideal-self). In 
addition to yielding these three scores, afourth 
score, Discrepancy, was obtained by Supenacrine the 
concept of self score from the ideal self score. 3 

A total of 455 pupils in District RSample IV com- 
pleted all three of the administrations of the IAV 
scheduled for this sample. In order toget some fur- 
ther insight into the relationship between the elemen- 
tary andjunior high school forms of the measure and 
some estimates of the stability of the IAV scores, 
the four IAV subscores earned by these pupils on the 
three administrations of the measure were correla- 
ted. The results may be found in Table 16. It will 
be noted that the correlations between the elementary 
school IAV administered at the fifth grade level and 
the junior high school IAV administered at the sixth 
and seventh grade levels were quite low, ranging 
from . 09 between the fifth and seventh grade Ideal 
Self scores to . 28 between the fifth and sixth grade 
Concept of Self scores. On the other hand, corre- 
lations between the sixth and seventh grade admin- 
istrations of the junior high school IAV ranged from 
.41 to .51. These data seem to indicate that the ele- 
mentary and junior high school levels of the test may 
be measuring variables that are only roughly com- 
parable. The correlations between the sixth and 
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TABLE 16 


CORRELATIONS BETWEEN IAV SCORES ADMINISTERED IN FIFTH, SIXTH, AND SEVENTH GRADES 


TO SAMPLE IV, DISTRICT R (N = 455) 


Concept of Self 


6th 7th 6th 
Fifth Grade . 28 27 .25 
Sixth Grade .ol 


seventh grade administrations seem to indicate that 
the junior high school form of the test has moder- 
ately good long term stability. 


RESULTS 


Sample IV 


Sample IV was administered the Bills Index of 
Adjustment and Values at the fifth, sixth, and sev- 
enth grade levels. At thefifth grade level, the ele- 
mentary form was employed while the junior high 
school form was administered at the sixth and sev- 
enth grade level. The Sample IV data provide us 
with our best information concerning the relation- 
ship at the elementary level between the treatments 
and the four scores obtained from the Index of Ad- 
justment and Values. All between-treatment data 
are summarized in Table 17. 

Concept of Self Data for Sample IV, during the 
first year of the project, revealed thatthe mean 
scores of all Sample IV groups were very similar, 
varying less than two points from the lowest to the 
highest group mean. One significant difference, 
however, was found between treatments. DistrictR 
superior pupils scored significantly higher on Con- 
cept of Self than pupils of comparable ability in Dis- 
trict A. Comparisons between pupils at the average 
and slow levels in the two districts yielded no sig- 
nificant differences although the District R samples 
were slightly higher in both instances. Thus, we 
find that exposure to ability grouping for a little less 
than one school year seemed to have had little in- 
fluence upon the self-concept of the pupils involved 
when these scores were compared with the District 
R pupils who remained inrandom grouped class- 
rooms. 

Although following somewhat similar patterns, 
the Concept of Self scores obtained by Sample IV at 
the sixth-grade level yielded a number of statistically 
significant differences. District R pupils generally 


Acceptance of Self 


7th 


. 20 


. 46 


Ideal Self Discrepancy Score 
6th 7th 6th 7th 
215 . 09 16 .18 

. 46 . 41 


scored higher than District A pupils on this meas- 
ure. In comparing groups of similar ability, wefind 
District R pupils were significantly higher than their 
counterparts in District A in four of the six between- 
treatment comparisons. Only the difference be- 
tween slow boys in the two districts failed to reach 
or approach statistical significance. 

During the final year of the study, the patterns 
that had begun to emerge in the sixth grade data be- 
came clearer. Mean scores obtained during the 
seventh grade showed District R pupils to be signif- 
icantly higher than comparable pupils in District A 
in four of the six between-treatment com parisons. 
Within-treatment differences also revealed a con- 
sistent pattern in both districts, and for bothsexes; 
showing the superior pupils to have the highest Con- 
cept of Self scores, the average pupils the next high- 
est and the slow pupils, the lowest. 

In reviewing the Concept of Self data for Sample 
IV, several important findings seem to emerge. 
First, the very slight differences existing between 
the districts during the fifth grade became greater 
during the sixth grade and still greater at the sev- 
enth grade level. This trend could indicate acumu- 
lative effect of the two treatments upon self-concept 
brought about by an increased awareness ofthe 
grouping. It would also suggest that pupils place a 
higher value upon their achievement status as they 
mature and move into the junior high school. It is 
interesting to note that the meanConcept of Self 
scores of all groups went up between the sixth grade 
and seventh grade testing, with the exception of Dis- 
trict A slow pupils. This exception may lend sup- 
port to the argument that ability grouping is destruc- 
tive to the self-concept of the slow pupil as it tends 


to label or stigmatize him. 


Another finding worthy of discussion inreviewing 
the Sample IV Concept of Self datais the consistent- 
ly higher scores earned by District R pupils as com- 
pared with District A pupils of comparable ability. 
In every one of the 15 comparisons made between 
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TABLE 17 


SUMMARY OF SIGNIFICANT BETWEEN- TREATMENT DIFFERENCES ON THE IAV 


Concept of Self 


Superior Boys 
Superior Girls 
Average Boys 
Average Girls 
Slow Boys 
Slow Girls 


Acceptance of Self 


Superior Boys 
Superior Girls 
Average Boys 
Average Girls 
Slow Boys 
Slow Girls 


Ideal Self 


Superior Boys 
Superior Girls 
Average Boys 
Average Girls 
Slow Boys 
Slow Girls 


Discrepancy Score Cc 


Superior Boys 
Superior Girls 
Average Boys 
Average Girls 
Slow Boys 
Slow Girls 


52 


Rb 


SELF-CONCEPT 


Sample IV 
Grade 
6 7 7 
R R 
R* 
R R 
R R R 
R* 
R* R 
R R 
R* 
R R 
R 
R 
R* 
R R 
R R 
R 
A A* 
A* 
A 
A 
A 


Sample VI 


Grade 


8 


A* 


* Approached significance, (t-value 1. 65 to 1. 95 inclusive). 


4 As data were not analysed separately by sex for this sample, a significant difference when found 
is indicated for both boys and girls. 
An ‘‘R’’ indicates that the District R subgroup had a significantly higher mean score; ‘‘A’’ indicates 


b 


a significantly higher mean for the District A group. 


High Discrepancy Scores are considered undesirable. 


A* 


R* 


A* 


A* 


Sample VII 
Grade 
8 9 
R 
R 
R R 
R R 
R 
R* 
R* 
R 
A* 
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Sample VIII Sample IX 


Grade 


Grade 


9 11% 


A* 


A* 
A* 
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pupils of comparable ability over the three adminis- 
trations of the IAV, District R pupils obtained the 
higher mean score and in 11 of these comparisons, 


this difference was statistically significant. The fact . 


that these differences were slight at the fifth-grade 
level and fairly large at the seventh- grade level sug- 
gests that they cannot be attributed to initial differ- 
ences in the pupils themselves but are more likely 
due to the grouping treatments. Treatment effects 
seem by far the most likely explanation as it will be 
recalled that the pupils in each district came from 
several different schools and were found to be rea- 
sonably comparable on variables such as sociomet- 
ric status that could have had an effect upon Concept 
of Self scores. 

Bills considers the Acceptance of Self score to 
be of major importance in interpreting theIAV. Pat- 
terns on the Acceptance of Self scores were gener- 
ally similar to those found on the Concept of Self 
scores. District R pupils generally received higher 
Acceptance of Self scores than District A pupils of 
comparable ability. Between-treatment compari- 
sons over the three replications revealed that all 
three of the first year differences were statistically 
significant beyond the . 01 level of confidence infav- 
or of District R pupils. For the last two years, 12 
between-treatment comparisons were made. Five 
of the six differences between boys’ subgroups were 
statistically significant, while none of the differ- 
ences between girls’ subgroups reached significance. 
It is interesting to speculate on the possible reasons 
for the greater between-treatment differences be- 
tween boys’ groups. 

Although the Discrepancy Score obtained in this 
study by subtracting the Concept of Self score from 
the Ideal Self score, is the last score obtained inthe 
IAV sequence, we will treat it at this time because 
of it’s apparent similarity to the Acceptance of Self 
score, It would appear that the Discrepancy Score 
measures self-acceptance somewhat less directly and 
in a different manner than that employed by the Ac- 
ceptance of Self score itself. For the other scores 
obtained, the higher scores are indicative of amore 
favorable self-concept. The reverse is true, how- 
ever, for the Discrepancy Score asa large score in- 
dicates a great discrepancy between the way the in- 
dividual views himself and the ideal which he would 
like to attain. 

Two analyses were carried out on the Discrep- 
ancy Score data. Examination of the results of the 
first analysis showed that inthe majority of cases 
the low ability pupils had smaller Discrepancy 
Scores which would indicate better adjustment. As 
these data seemed to indicate that slow pupils had 
the best adjustment to their self-concept, andas this 
part of the Discrepancy Score results did not agree 
with the Acceptance of Self data, it was decided that 
a careful recheck should be made of the IAV test pa- 
pers to see if some spurious element had entered 
the situation to reduce the size of the Discrepancy 


Score for low ability pupils. A number of cases 
were found upon examination of Sample IV scores 
that suggested that pupils had mechanically checked 
the responses on the various columnsof the IAV. In 
other words, pupils had followed identical patterns 
in checking responses on the three columns which 
usually suggests either a negative attitude concern- 
ing the test or a lack of understanding concerning 
the instructions. Other papers were found in which 
pupils received a higher score inthe Concept of Self 
column than in the Ideal Self column, thus yieldinga 
negative Discrepancy Score. Bills recommends cal- 
culating the Discrepancy Score by taking the differ- 
ence between the Column 1 and Column 3 entry, re- 
gardless of which is larger. This procedure was 
not followed as it seemed likely that persons whose 
Discrepancy Scores were negative, i.e. , Concept of 
Self higher than Ideal Self would not be comparable 
to pupils with equal but positive Discrepancy Scores. 
Most of the papers yielding negative Discrepancy 
Scores seemed to indicate amisunderstanding of the 
instructions, while a few appeared to identify pupils 
who consciously reported themselves superior to 
their own ideal self. The apparently spurious papers 
were most often obtained from pupils of low ability 
and as such papers yielded a zero or negative Dis- 
crepancy Score, they would tend to lower dispropor- 
tionately the mean Discrepancy Score for low ability 
pupils. In order to remove these apparently spuri- 
ous papers, a set of criteria were developed for 
identifying probably spurious papers and these were 
removed. After these papers were removed, asec- 
ond analysis of variance of the Discrepancy Score 
data was calculated. 

This analysis revealed that District R pupils re- 
ceived more desirable Discrepancy Scores than Dis- 
trict A pupils of comparable ability in all 15 com- 
parisons made in the three replications of the study. 
Ten of these 15 differences were statistically signifi- 
cant. Four of the five differences between superior 
pupils in the two districts reached statistical signif- 
icance. Significant differences occurred most fre- 
quently between superior pupils and least frequently 
in comparisons between slow pupils. This is prob- 
ably due in part to the small numbers of cases in the 
slow groups, as the actual differences between the 
mean Discrepancy Scores of the slow groups in the 
two districts were quite large except at the fifth- 
grade level, | 

Again it was noted that the between-treatment dif- 
ferences became greater over the three year period 
of the study. Four of the six between-treatment dif- 
ferences during the third year were statistically sig- 
nificant and the t’s for each ability level were higher 
than the t’s for the same ability level during the 
first year. Thus, a differential effect favoring Dis- 
trict R seemed to be operating with respect to 
changes in the Discrepancy Score. Comparisons of 
the magnitude of these scores from year to year can 
best be made between the finaltwo years as the Ele- 
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mentary School IAV administered during the fifth 
grade is shorter and yields scores that are not 
closely comparable to those obtained using the Jun- 
ior High School IAV in the sixth and seventh grades 
(see Table 17). 

The Ideal Self score on the Index of Adjustment 
and Values calls for the individual to construct an 
ideal based on the adjectives presented inthe scale. 
In other words, the pupil reports the type of person 
he would like to be; thus reflecting his aspirations 
in terms of future self. It was anticipated inthis 
project that the Ideal Self measure would be less 
likely to be influenced by ability grouping or random 
grouping than the other scores obtainedfrom the 
IAV. The mean differences between pupils of com- 
parable ability in the two districts were small dur- 
ing all three years in which the [AV was adminis- 
tered. No significant between-treatment differ- 
ences were found during the first year. Two signif- 
icant differences, between the two average groups, 
were found in the subsequent year. It is interest- 
ing to note that for both years two and three, Dis- 
trict R average boys obtained significantly higher 
Ideal Self scores than comparable District A pupils 
(see Table 17). 


Self Concept Changes Accompanying Changes in 
Ability Level in District A 


In view of previous research that indicates most 
pupils are aware of the ability level to which they 
are assigned in ability grouped systems, it was hy- 
pothesized that reassigment of District A pupils to 
a lower ability section would be accom panied by 
changes in the Concept of Self, Acceptance of Self, 
and Discrepancy Scores on the IAV. Inorder totest 
this hypothesis at the elementary school level, pu- 
pils in Sample IV who had completed the IAV in both 
the fifth and sixth grades were classified into three 
groups: 1) pupils who were assigned to the same 
ability levels in both grades, 2) pupilswhowere as- 
signed to a higher ability level in the sixth grade 
than they had occupied in the fifth grade and 3) pu- 
pils who were assigned a lower ability level inthe 
sixth grade than they occupied in the fifth grade. It 
may be seen in Table 18 that the vast majority of pu- 
pils were assigned to the same level during the fifth 
and sixth grade. A total of 26 pupils for whom com- 
plete data were available moved to a higher ability 
level, while 37 pupils moved to a lower level. 

These numbers, although smaller than would be 
desirable, were considered large enough to justify 
a comparative analysis of the self-concept data. The 
fifth grade and sixth grade mean scores on the four 
IAV subtests earned by these three groups of pupils 
were compared using analysis of variance and then 
the sixth grade means were adjustedfor fifth grade 
differences using analysis of covariance to deter- 
mine whether significant changes in the self-concept 
scores had taken place. 


In reviewing the Concept of Self scores, it is 
found that groups one and two (pupils assigned tothe 
same or higher level in the sixth grade) obtained sig- 
nificantly higher scores than did group three (pupils 
assigned to a lower level) on the fifth grade admin- 
istration of the IAV. This difference, although sta- 
tistically significant at the .05level, was small. At 
the sixth grade level, group three scored signifi- 
cantly lower than the other two groups. 

These data indicate that pupils assigned to a low- 
er ability level in the sixth grade were significantly 
lower in Concept of Self both before and after this 
reassignment, but that the reassignment itself did 
not appear to lead to any marked deterioration of the 
individual’s Concept of Self scores. Thus, the lower 
Concept of Self score may have been a contributing 
cause to the reassignment of the pupilto a lower 
ability level or may have resulted from initial mis- 
assignment of the pupil to a level where he could not 
compete. Little evidence is present in the Sample 
IV data, however, to indicate that assignment of the 
pupil to a lower ability level brings about a lowering 
of Concept of Self. 

The Acceptance of Self data showfifth grade 
scores for the three groups to be very similar, with 
none of the differences between these initial scores 
reaching statistical significance. 

It was tentatively hypothesized that no changes in 
Ideal Self scores would take place for pupils as- 
signed to the same ability level in subsequent years 
but that reassignment to a lower ability level would 
tend to lower Ideal Self scores and assignment to 
a higher ability level would tend to raise Ideal Self 
scores. This hypothesis was not supported by the 
data. Both fifth grade and sixth grade group means 
on the Ideal Self score were very similar and none of 
the differences obtained approached statistical sig- 
nificance. 

The mean Discrepancy Score, essentially an in- 
direct measure of self acceptance, was expected to 
go up for group two and go down for group three on 
the sixth grade testing. Reference to Table 18 
shows a trend for group three pupils to have higher 
Discrepancy Scores on both the fifthand sixth grade 
administrations of the IAV. The differences at the 
fifth grade level were very slight, butthe sixth 
grade differences approached statistical significance. 


sample VI 


Pupils in Sample VI were administered the junior- 
high school form of the Index of Adjustment and Val- 
ues in the seventh, eighth, and ninth grades. The 
TAV results for this sample provide the best evidence 
available in this study of the effects of ability group- 
ing upon self-concept at the junior high school level. 

Differences between Sample VI pupils onthe Con- 
cept of Self score on the IAV are summarized in n Ta- 
ble 17. For the seventh grade administration of the 
IAV (when Sample VI pupils had been exposed to one 
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year of ability grouping or random grouping atthe 
sixth grade level and most of a year at the junior 

high school level) two significant between-treatment 
differences emerged. District R superior girls were 
found to be significantly higher in Concept of Self 
than District A superior girls, and District A aver- 


age girls were significantly higher than girlsofcom- 


parable ability in District A. Atthe eighthand ninth 
grade testing, District R superior girls again made 
higher scores than District A superior girls, but 
these differences did not reach statistical signif- 
icance. In the case of average girls, however, the 
District R sample was significantly higher thancom- 
parable District A girls in allthree administrations 
of the junior high school Index of Adjustment and 
Values. Of the nine comparisons between girls of 
comparable ability in Districts A and R, District R 
girls received higher Concept of Self scores in every 
comparison. In contrast, there were no significant 
between-treatment differences for comparable boys 
groups in District Aversus District R. These data 
suggest that differences in the two grouping treat- 
ments had a somewhat greater effect upon girls than 
upon boys with regard to Concept of Self scores. 

Scores on the Acceptance of Self section of the 
IAV again seem to reflect the greater impact of the 
grouping treatments on girls’ subgroups. No signif- 
icant between-treatment differences were found in 
the nine comparisons between District Rand Dis- 
trict A boys of comparable ability. Differences be- 
tween comparable girls’ groups intne two districts, 
revealed that of the nine comparisons made, all fav- 
ored District R pupils. Two of these differences 
were Statistically significant andthree more ap- 
proached significance. 

It will be recalled that Discrepancy Scores in 
Sample IV were reanalyzed after removing scores 
that, upon examination of the answer sheets, ap- 
peared to be spurious. Such answer sneets were re- 
moved from the Sample VI data prior tothe start of 
the analysis so that scores for these cases were not 
included in any of the IAV subtests for Sample VI. 

For Sample IV, rather large and consistent Dis- 
crepancy Score differences favoring District R were 
found in the between-treatment analysis. The be- 
tween-treatment differences in Sample VI generally 
followed the same pattern but were much smaller 
with only two differences, both between average 
girls in the two districts, being statistically signif- 
icant. 

Between-treatment differences reac hing or ap- 
proaching significance increased during the final 
year of the study. Even in this final year, however, 
these differences were considerably less marked 
than were found in Sample IV, suggesting that the 
grouping treatment at the junior high school level 
had somewhat less of an effect upon Discrepancy 
Scores than the more extensive differences that 
were present at the elementary school level. 

Results for Sample VI onthe IAV Ideal Self score 


are also summarized in Table 17. Of the nine be- 
tween treatment comparisons for boys, only one, 
favoring District R slow boys, was statistically sig- 
nificant. A somewhat more definite pattern emerged, 
however, in the between-treatment differences for 
girls. In seven of the nine comparisons, the Dis- 
trict R girls scored higher, with two ofthese differ- 
ences reaching statistical significance. 


Self-Concept Changes Accompanying Ability Level 
Changes in District A 


In order to determine whether the four self-con- 
cept variables were affected by changes in the pu- 
pil’s ability level in ability grouped classes, Sample 
VI pupils were classified into six groups: boys and 
girls assigned to the same ability level in the sixth 
and seventh grades, boys and girls assigned toa 
higher ability level in the seventh grade, and boys 
and girls assigned a lower ability level inthe sev- 
enth grade. Analysis of covariance was carried out 
for each of the four self-concept variables to deter- 
mine whether the final adjusted mean scores on 
these variables were significantly different for like- 
sex pupils whose ability classification remained the 
Same, went up, or went down. The results of this 
analysis may be found in Table 19. Aswasthe case 
with Sample IV, the majority of pupils were assigned 
to the same ability levels in the sixth and seventh 
grades. A sufficient number of both boys and girls, 
however, were assigned either higher or lower abil- 
ity levels to permit an analysis of these data. 

In comparing the Concept of Selfscores for the 
sixth grade administration of the IAV, it was found 
that boys assigned the same level (group 1) were sig- 
nificantly higher than boys who were subseque nt ly 
assigned to a higher level or a lower level. On the 
seventh grade administration, however, these initial 
differences had largely disappeared. 

In contrast, the three girls’ groups were closely 
comparable on the sixth grade Concept of Self score 
but differences had emerged by the seventh-grade 
administration. Comparison of the adjusted seventh- 
grade means of girls’ groups on the Concept of Self 
score revealed that girls assigned the same ability 
level in both the sixth and seventh grades (group 2) 
had higher mean scores than both the girls assigned 
to a higher level (group 4) and the girls assigned to 
a lower level in the seventh grade (group 6). This 
latter difference was significant at the. 01 level, and 
would suggest that in an ability grouped situation, 
girls find remaining in the same ability levelless 
threatening to self-concept than either moving to a 
higher level or a lower level, with movement to a 
lower level having the least favorable effect upon 
their scores. 

Scores on Acceptance of Self showed that group 1 
boys again had a somewhat higher mean score than 
either group 3 or group 9 boys, but these differences 
were not statistically significant. At the seventh- 
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grade testing, we findthat the three boys’ groups 
had become more closely comparable onthis score. 
A comparison of the adjusted seventh grade means 
for the girls’ groups showed both group 2 and group 
4 to have significantly higher adjusted Acceptance 
of Self means than group 6. Thus, it appears that 
for girls, being assigned to a higher ability leveldid 
not result in a significant increase in Acceptance of 
Self scores, but assignment to a lower level brought 
about a relative decline in these scores. These data 
generally support the Sample IV findings. 

Initial Ideal Self scores showed the boys who were 
to remain at the same leveltobe significantly high- 
er than those who were to be assigned toa higher or 
lower ability level during the subsequent year. The 
group 1 boys were significantly higher than group 3 
on the final adjusted Ideal Selfmean. Group 3 and 
group 9 boys were not significantly different on the 
final adjusted mean. Comparisons between the 
three girls’ groups showed them to be quite similar 
in their sixth grade mean scores withsomewhat 
larger differences occurring in the seventh grade 
means. The final adjusted mean score for group 2 
was significantly higher than that for group 6 on the 
Ideal Selfscore. It is interesting to note that al- 
though the final adjusted means for group 4 and 
group 6 girls were not significantly different,the 
girls who were assigned a higher ability level ex- 
perienced an increase of about two points between 
the sixth and seventh grade tests while the girls who 
were assigned a lower level in the seventh grade ex- 
perienced a decrease of two points onIdeal Self. 
These data suggest that for the girls’ groups, assign- 
ment to a lower ability level tends to lower the in- 
dividual’s Ideal Self score. 

No significant differences in Discrepancy Scores 
were found among pupils who moved up, down, or 
remained the same in ability level classification be- 
tween the sixth and seventh grades. All groups had 
slightly lower Discrepancy Scores at the seventh 
grade level. 


Sample VII 


In reviewing the between-treatment differences 
in Concept of Selffor Sample VII, we find District R 
superior girls have significantly higher Concept of 
Self scores than District A superior girls for the 
eighth grade testing (see Table 17). Other between- 
treatment differences were not statistically signifi- 
cant although three differences approached statisti- 
cal significance, all favoring District Rgirls. When 
one considers that only one between-treatment dif- 
ference reached statistical significance at the .05 
level out of the 12 comparisons made, it seems pos- 
sible that this single significant difference is due to 
chance. The fact, however, that the three differ- 
ences which approached significance also favored 
District R, suggests that if any self-concept differ- 
ence did exist, itfavored the random grouped sample. 


Between-treatment differences onthe Acceptance 
of Self score for Sample VII revealed three signifi- 
cant differences, all favoring District R girls as 
compared with District A girls. For both adminis- 
trations of the IAV, District R superior girls were 
significantly higher than District A superior girls. 
During the 1960-61 year, District R average girls 
were significantly higher than District Aaverage 
girls. The relatively small number of slow girls in 
the two samples led to a considerably larger stand- 
ard error of the difference between means for these 
groups, thus requiring greater differences to reach 
statistical significance. A rather steady trend ap- 
peared to be present in all samples for District R 
slow girls to score higher on Acceptance of Self 
than District A slow girls. These results suggest 
that a true difference is present although this differ- 
ence reaches Statistical significance in only about 
half of the replications. 

Comparisons between the mean Discrepancy 
Scores of subgroups in thetwodistricts revealed no 
statistically significant between-treatment differ- 
ences. Between-treatment comparisons for Sample 
VII on the Ideal Self score revealed only one statis- 
tically significant difference favoring District R slow 
girls. 


Sample VIII 


Only two significant between-treatment differ- 
ences occurred in the Sample VIII Concept of Self 
scores. During the ninth grade, District R average 
boys scored significantly higher than District A 
average boys. During the tenth grade, District A 
superior boys scored significantly higher than boys 
of comparable ability in District R. 

Comparison of Acceptance of Self means revealed 
only one statistically significant between-treatment 
difference for each of the two years inwhich the IAV 
was administered to this sample. District R supe- 
rior girls were significantly higher during the ninth- 
grade year, and District R average boys were sig- 
nificantly higher during the tenth grade year. 

Only one significant between-treatment difference 
on Discrepancy Score means was obtained for this 
sample. District A superior girls hada higher mean 
than comparable District R girls. No significant be- 
tween-treatment differences were found on the Ideal 
Selfscores. The presence of only one significant 
difference in the tenth grade data suggests that at the 
high school level the somewhat changed situation 
brought about by the wide range of electives available 
in both districts and less distinct ability groupings in 
District A, probably reduced the importance of the 
grouping treatments in terms of their influence upon 
pupil’s self-concept. 


Sample IX 
The Index of Adjustment and Values was adminis- 


This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:32:32 UTC 
All use subject to http://about.jstor.org/terms 


74 ABILITY GROUPING 


tered to Sample IX pupils during their ninth and 
eleventh grade years. Analysis procedures for this 
sample followed the same patternemployed with 
previous samples excepting that the groups were 
not divided by sex for the analysis of the eleventh 

grade data. 

Between-treatment comparisons of Concept of 
Self scores for Sample IX are found in Table 17. 
Only one significant between-treatment difference 
was found, ninth grade average District R girls 
scoring higher than comparable girls in District A. 
No statistically significant differences were found in 
the between-treatment comparisons of Sample Ix 
pupils on the Acceptance of Selfscore. Between- 
treatment comparisons of the Discrepancy Score 
showed a consistent tendency for District A pupils 
to obtain nigher (less favorable) scores than com- 
parable pupils in District R. Only the difference be- 
tween superior girls reached statistical significance 
but three other differences approached significance. 
Only one significant between-treatment difference 
was found in Ideal Self; superior pupils in District 
A scoring higher than tnose in District R. 

In reviewing the data for Sample IX, we find that 
the ninth grade results are generally in line with 
those found in other samples. The almost complete 
lack of significant differences between LAV scores 
of different subgroups at the eleventh grade level is 
probably caused by the small number of cases and 
the fact that the actual differences between the abil- 
ity grouping and random treatments is muchsmaller 
at this level than in the junior high school and ele- 
mentary grades. Similar results werefoundfor the 
Sample VIII high school data. 


SUMMARY AND CONCLUSIONS 


All five samples in the Utah study were adminis- 
tered the IAV. Samples IV and VI were tested three 
times over a three year period, while Samples VII, 
VIII, and [X were tested twice over a two year 
period. 

Between-treatment differences onthe four scores 
obtained from Bills’ Index of Adjustment and Values 
are summarized in Table 17. For each of the four 
IAV variables, 66 between-treatment compar isons 
were made, 22 at each ability level. The largest 
number of between-treatment comparisons, 18, 
were made for Sample VI pupils. Fifteen compari- 
sons were made for Sample IV, 12 for Samples VII 
and VIII, and nine for Sample IX. Data collected 
from the sixth through the ninth grade levels prob- 
ably provide the most meaningful results on self- 
concept. Differences betweenfifth grade pupils 
were based on the elementary school form of the 
IAV which is shorter and somewhat less reliable 
than the junior high school form. Differences be- 
tween pupils in the high school are less meaningful 
because of the less clear-cut distinction between the 
two grouping treatments at that level. Except for 


data collected on fifth grade pupils inSaniple IV, 
and eleventh grade pupils in Sample IX, boys and 
girls were placed in separate groupsfor the IAV 
analyses. 

Of the 66 between-treatment comparisons on the 
Concept of Self score, 23 reached or approached sta- 
tistical significance. All but two of these 23 differ- 
ences were in a direction favoring District R pupils. 
Eight of these differences were between superior pu- 
pils in the two districts, tenbetween average pupils, 
and five between slow pupils. It is interestingto 
note in the Concept of Self analysis that lo ofthe 23 
differences reaching or approaching statistical sig- 
nificance were between girls’ groups while only sev- 
enofthese differences were between boys’ groups. 
One of the significant differences, for Sample IV, 
was based on groups not divided by sex. The extent 
and consistency of the Concept of Self data seem to 
justify the conclusion that pupils in random grouped 
classrooms tended to obtain more favorable Concept 
of Self scores thancomparable pupils in ability 
grouped classrooms, at all ability levels. The data 
also appear to support the conclusion that the two 
grouping treatments had a somewhat greater effect 
upon girls than boys. These data may reflect the 
greater preoccupation of girls with achievement and 
academic success. The results generally suggest 
that placement in an ability group tends to adversely 
affect girls’ self-concept. This adverse effect is of 
least importance for pupils at the lower ability level. 

Of the 66 between-treatment comparisons of IAV 
Acceptance of Self mean scores, 15 were statisti- 
cally significant and seven others approached signif- 
icance. These results reflected essentially the 
same trend found in analysis of the Concept of Self 
data. Six of the significant differences were between 
superior pupils, six between average pupils, and 
three between slow pupils in the two districts. All 
15 of these significant between-treatment differ- 
ences favored District R as did four of the seven dif- 
ferences that approached significance. It will be 
noted that the significant differences occurred be- 
tween girls’ groups and boys’ groups with about 
equal frequency. When we consider all results rel- 
ative to acceptance of self obtained in this study, a 
consistent patternfavoring District Rpupils 
emerges. This pattern is most pronounced for su- 
perior and average pupils but is present to a lesser 
degree for slow pupils. Based on these datait must 
be concluded that random grouping is consistently 
related to higher self acceptance for pupils at all 
ability levels and over most of the grade levels cov- 
ered in this project. 

The overall summary of between-treatment com- 
parisons on the IAV Discrepancy Score shows 14 of 
the 66 differences to be statistically significant. Six 
of these differences were between superior pupils in 
the two districts. Six were between average pupils 
and two were between the slow pupils. All but one 
of the 14 significant differences showed higher (less 
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favorable) Discrepancy Score means for District A 

subgroups. In addition to these significant differ- 
ences, eight differences approached statistical sig- 
nificance, and all of these showed higher mean Dis- 
crepancy Scores for District A groups. Significant 
differences occurred with about equal frequency for 
boys’ and girls’ groups. It may be concluded that 
the two grouping treatments hada differential effect 
upon the pupils’ Discrepancy Scores with ability 

grouping generally associated with less favorable 

scores. This pattern is most pronounced inthe dif- 

ferences between the mean scores of superior pupils 
in the two treatments, closely followed by average 

pupils. The differences between slow pupils are 

substantially smaller. 

Of the 66 between-treatment comparisons of 
mean Ideal Self scores on the Index of Adjustment 
and Values, eight reached statistical significance. 
Seven of these eight differences favored District R 
pupils, witiitwo occurring between superior groups, 
four between average groups and two between slow 
groups. It will be noted in Table 17thatthefew sig- 
nificant differences on the Ideal Self scores are gen- 
erally scattered. Only in the comparisons between 
average girls do we find significant differences oc- 
curring in more than one sample. From these data 
we may conclude that a slight trendfavoring District 
R pupils is in evidence on the Ideal Self scores. 
This trend, producing relatively few significant re- 
sults, suggests that the grouping system has little 
or no influence upon the Ideal Self score. 


Self-Concept Changes Accompanying Ability Level 
Changes in District A 


Pupils in District A, Sample IV who changed abil- 
ity levels between the fifth and sixth grade adminis- 
trations of the [AV and pupils inSample VI who 
changed ability levels between the sixth and seventh- 
grade administrations of the IAV were compared to 
determine whether changes in ability classification 
would be accompanied by changes inthe self-concept 
variables. The results of these analyses may be 
found in Tables 18 and 19. The reader is reminded 
that the Sample VI data are generally more meaning- 
ful because sufficient cases were available to divide 
the subgroups by sex, and because the junior high 
school form of the IAVwas administered at both 


sixth and seventh grade levels, yielding more com- 
parable data than were obtainedfor Sample IV. 
From these analyses, it may be concluded that: 

1. Placement of a pupil in a lower ability class- 
ification appears to bring about lower scores inCon- 
cept of Self, Acceptance of Self, and Ideal Self. 

2. The IAVscores of girls assigned to a lower 
ability group appear to be more drastically affected 
than are the scores of boys under the same circum- 
stances. This may reflect the tendency for girls to 
place a higher value on academic achievement and 
status. 

3. Pupils destined to be reassigned to alower 
ability classification show lower scores in Concept 
of Self and Acceptance of Self prior to reassignment. 
This lower initial score may indicate that misassign- 
ment of a pupil to a higher groupthan his ability 
would justify brings about a lowering of his Concept 
of Self and Acceptance of Self because of his inabil- 
ity to perform satisfactorily in his assigned group. 

4, Remaining in the same ability classification 
in an ability grouping system appears tobeless 
threatening to the pupil’s self-concept than reclass- 
ification to either a higher or lowersection. Re- 
classification to a higher section may lead toaslight 
loss in the pupil’s self-concept because he is prob- 
ably near the bottom of the ability range in his new 
group. This self-concept loss may be compensated 
to some extent by the pupil’s identification with the 
higher ability classification. The effect upon pupils 
assigned to a lower ability group, however, appears 
to be much greater. These pupils probably lose 
self-concept because of their initial misassignment 
and then their self-concept isfurther lowered, 
especially for girls, when they are confronted with 
the stigma of identification witha lower ability group. 

Whether the less favorable scores on the self- 
concept variables that occurred rather consistently 
in the ability grouped samples are considered 
‘‘good’’ or ‘‘bad’’ depends on one’s point of view. 
When considered in the light of psychological re- 
search in related areas such as aspiration level, 
however, the author must conclude that the apparent 
effects of ability grouping uponthe self-concept var- 
iables that emerged from the Utah study are prob- 
ably harmful to the development of at least some of 
the pupils who are educated under such a system of 
grouping. 
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PERSONALITY 


ALTHOUGH many ofthe arguments heard among 
educators on the merits anddemerits of ability 
grouping imply that this practice can have an effect 
upon tne personality development of pupils involved, 
very little direct researcn evidence comparing the 
personality characteristics of pupils in ability and 
random grouping systems has been published up to 
tne time of this writing. Subjective appraisalof the 
differences that appear to be present inability 
srouped and random grouped classes along with the 
consideration of the limited evidence onthe relative 
merits of these systems suggest possible personal- 
ity differences that might occur as a result of differ - 
ences in grouping treatment. 

It might seem a simple matter to identify a test 
measuring the characteristics that we hypothesize 
are affected by the different grouping treatments, 
administer the test to pupils in the two treatments 
and determine whether differences between the 
scores are Statistically significant. This is essen- 
tially the approach used in this project, but the 
reader should be aware of the many limitations at- 
tending this approach in the area of personality 
measurement. First, the greatest differences be- 
tween the grouping systems studied inthis research 
were at the elementary school level where the self- 
contained classroom system resulted in placing Dis- 
trict A pupils in a single apility level classification 
for the entire school day. Unfortunately, however, 
at this level very few personality measures are 
available, and those available can be regarded at 
best as very crude instruments. At the junior: high 
school level, a greater number of personality meas- 
ures are available but it will be recalledthat, as 
District A employed grouping only in mathematics 
and science at this level, the treatment differences 
were much less marked than at the elementary lev- 
el and consequently would be expected to bring about 
smaller differences in personality. 

Another practical problem arising inthe study of 
personality differences between pupils in the two 
grouping treatments is the question of how much 
change in personality we might expect to take place 
as a result of changing a fairly small portion of the 
individual’s total experience. Let us suppose, for 
example, that one grouping treatment did tend more 
frequently than the other to develop inferiority feel- 
ings. It would seem a strong possibility that the 
many other experiences in the pupil’s day that may 
structure personality and that cannot adequately be 
controlled could mask any such personality differ- 
ences that resulted from the grouping treatment. 


A related question that must be faced is whether 
or not personality changes that might be brought 
about by the two grouping treatments could be ex- 
pected to become apparent in a period of one or two 
years? The lack of any basisfor answering this 
question, of course, leaves us with the possibility 
that the generally negative results obtained could 
reflect an inadequate exposure of the subjects to the 
treatment differences. 


Another factor to be considered is the generally 
low level of our knowledge concerning the nature of 
personality and its measurement. Evidence of valid- 
ity is rather scant for most personality measures, 
and even when validity data are giventhese data us- 
ually are based upon correlations with clinical judge- 
ment or with other criteria that themselves are of 
questionable validity. 


Some psychologists, though condemning the per- 
sonality inventory, place great confidence in pro- 
jective tests as a source of personality information. 
This project employed both group administered per- 
sonality inventories and individual and group admin- 
istered projective measures. The unbiased re- 
searcher, however, must conclude that neither the 
personality inventory nor the projective measure at 
its current state of development provides very solid 
evidence. Thus, the personality findings presented 
in this section should be considered at best tentative. 


WORKING HYPOTHESES 


Working hypotheses were established and several 
personality measures were studied in order to iden- 
tify variables that appeared to measure the person- 
ality characteristics that might be influenced by the 
two grouping treatments. Variables from four per- 
sonality inventories were employed: the California 


Test of Personality (CTP), Inventory of Factors 
GAMIN (GAMIN), Objective Analytic Personality 


Tests (OA), and the California Psychological Inven- 
tory (CPI). A projective measure employing cards 
from the Thematic Apperception Test (TAT) and the 
Michigan Picture Test (MPT) was also used to meas- 
ure three variables that seemed particularly im- 
portant. 

Previous research evidence provided little in- 
formation or help in establishing working hy poth- 
eses are therefore quite tentative. 

1. For the first variable, Feeling of Belonging, 
(CTP) it was hypothesized that pupils at boththe su- 
perior and slow ability levels would obtain more 
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favorable scores in ability grouped classrooms. It 
would appear that these pupils would experiencea 
greater feeling of belonging in the less heterogene- 
ous District A groups. Slow pupils, especially, 
would seem more likely to feel secure in a group of 
other slow pupils. A gradual deterioration of this 
characteristic for pupils in slow groups might occur, 
however, outside of the classroom. Ability group- 
ing may cause both bright and slow children to sense 
that they are ‘‘different’’. For the slow child this 
difference carries with it the label of inferiority that 
is implied by his membership in the low ability 
class. Thus, the very act of separating pupils into 
groups on the basis of ability might make them ac- 
tively aware of differences which could in turn low- 
er their ‘‘feeling of belonging’’. 

2. It was hypothesized that low ability pupils 
would display significantly higher scores on With- 
drawing Tendencies (CTP) and Anti-Social Tenden- 
cies (CTP) in the ability grouped situation. Both 
seemed likely to be reactions to frustration. Evi- 
dence obtained in Drews’ study (31) that showed 
much higher levels of class participation for slow 
ability grouped students would suggest that such stu- 
dents would be less likely to experience frustration 
in ability grouped classes. Studies by Jackson (63) 
and Marsh (61) also reported fewer anti-social be- 
havior patterns in ability grouped situations. Math- 
ias’ work (81) which was discussed in the review of 
literature, would also suggest that less aggression 
would occur in ability grouped classes. 

3. Our working hypothesis with regard to aver- 
age pupils on the aforementioned CTP variables and 
to superior pupils on the latter two was that no sig- 
nificant between-treatment differences would occur. 

4. It was hypothesized that scores onascendancy 
in social situations, (GAMIN) would be higher for su- 
perior pupils in District R because of the greater 
opportunities for these pupils to obtain positions of 
social leadership in random grouped classrooms. 
Slow pupils, on the other hand, would appear to have 
a greater opportunity for social leadership in the 
ability grouped classroom. 

0. Thus, it was hypothesized that slow pupils in 
District A would obtain higher social ascendancy 
scores than comparable pupils in District A. 

6. It was hypothesized that District A slow pu- 
pils would receive less favorable scores on a meas- 
ure of inferiority feelings (GAMIN) than comparable 
pupils in District R. 

7. It was also hypothesized that superior Dis- 
trict A pupils would receive less favorable ‘‘inferi- 
ority’’ scores than District R pupils, although this 
difference might be expected to be less pronounced 
than the difference between the slow groups. For 
the slow child, many educators have suggested that 
the labeling that accompanies ability grouping would 
tend to lower self-confidence and develop inferiori- 
ty feelings. Some rather limited research evidence 
supports this contention (72). The superior pupil 


might also be expected to have greater self-confi- 
dence in the random grouped class because of the 
smaller number of pupils in the class able to com- 
pete with him academically and the greater chance 

he has of obtaining a position of social recognition 

within the peer group. 

8. It was hypothesized that superior pupils in 
District R would obtain higher scores on Competent 
Assertiveness (OA) than superior pupils in District 
A. Cattell, (25) in discussing this factor suggests 
that it is probably quite similar to achievement mo- 
tivation as studied by McClelland. The random 
grouping situation would seem more likely to devel- 
op the self perceptions underlying this factor in su- 
perior pupils than would the ability grouping situa- 
tion. 

Anxiety to Achieve (OA) although corresponding 
to some degree to the clinical concept of anxiety, 
was found by Cattell (25) to be loaded also with ver- 
bal ability, and scholastic achievement (holding in- 
telligence constant). It was hypothesized that supe- 
rior pupils in District A and slow pupils in District 
R would be higher on this variable. Both of these 
groups are subjected to considerable pressure to 
achieve and thus might be expected to display high 
levels of achievement-related anxiety. 

Working hypotheses for the California Psycholog- 
ical Inventory were developed for three of the four 
classes of variables measured by this instrument. 
Class I includes six measures in the broad area of 
poise, ascendancy and self-assurance. 

9, For the Class I variables, it was hypothesized 
that slow pupils would score higher in the ability 
grouped classrooms, superior pupils would score 
higher in the random grouped classrooms and no dif- 
ferences would be found for average pupils. The 
ability grouping situation seems to provide the low 
ability pupil considerably more opportunity for de- 
veloping some of the Class I characteristics. Dom- 
inance, for example, seems to have a much greater 
chance of emerging in a situation where the slow pu- 
pil is participating with peers of similar ability. Dif- 
ferences in grouping treatments might also be ex- 
pected to have some influence on the slow pupils 
Capacity for Status, Self Acceptance, and Sociability 
and should contribute directly to his Social Presence. 
This is supported by Drews’ research (31) which in- 
dicated that the slow pupil has more opportunity to 
experience successful social interaction in the abil- 
ity grouped situation. 

Ability grouping, on the other hand, appears to 
create an unfavorable climate for superior pupils to 
develop the Class I personality traits. Whenre- 
moved from their favored position in the random 
grouped classroom, it would appear that the superi- 
or pupil’s opportunities to develop Dominance, So- 


cial Presence, and Sense of Well Being, would be 


reduced. Sociability might similarly be reduced for 
the superior pupil in the ability grouped situation as 
it would seem likely that he would be less secure, 
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less able todominate the social situation and less 
abie to gain social experience in the more homoge- 
neous class. 

Three of the scales on the CPI make upC lass 
III — measures of achievement potential and intel- 
lectual efficiency. 

10. For two of these measures, Achievement via 
conformance (Ac) and Intellectual efficiency (Ie) it 
was hypothesized that superior pupils inability 
grouped classrooms would receive higher scores. 
Ability grouping with acceleration, where pupils are 
exposed to a uniform program at arate close totheir 
capacity would seem to developthese two character- 
istics while discouraging the thirdClass III variable, 
Achievement via independence (Ai). 

11. Thus, it is hypothesized that on the latter 
variable superior pupils in random grouped class- 
rooms would achieve more. Slow pupils inDistrict 
A were also hypothesized to obtainhigher Ac scores 
than their District R counterparts. No other be- 
tween-treatment differences were hypothesized for 
the Class III variables. 

Class IV measures are described by the test auth- 
or as those dealing with intellectual andinterest 
modes. A working hypothesis was developed for on- 
ly one of the three Class IV measures, Flexibility. 

12. It was hypothesized that the District Rclass- 
room environment with enrichment would tend tode- 
velop flexibility to a greater degree at all ability 
levels than would the ability grouped classroom with 
acceleration. No working hypotheses were devel- 
oped for the six measures of socialization, maturi- 
ty and responsibility which make up the Class II 
variables on the CPI. 


Projective Data 


In addition to obtaining data in the Utah study on 
self-report personality inventories, two aspects of 
the study were concerned with the measurement of 
specific personality variables using a projective ap- 
proach. !4 The first of these studies employed an in- 
dividually administered measure consisting of eight 


pictures selected from the Thematic Apperception 


Test (TAT) (87) and the Michigan Picture Test 
(MPT) (83). This study was concerned primarily 


with comparisons between scores of aggression, de- 
pression, and inferiority feelings obtained from slow 
fiftn grade pupils in Districts A and R, and will be 
referred to as the fifth grade study. The second 
study adapted substantially the same projective 
measure for group administration. This study in- 
volved sixtn grade pupils at all three ability levels. 
Mathias (81), found that in heterogeneous class- 
rooms, Superior and slow learners showed more 
signs of aggression and depressionthandid average 
pupils. Although his study contained no comparable 
data on ability grouped children, it had implications 
for ability grouping as he concluded that superior 
and slow learning children showed more aggression 


and depression than the average children because 
the educational program was not designed for them. 
This would infer that in an ability grouped situation 
where the program was more closely adapted to the 
pupil’s ability level, less aggression and depression 
would be found. 

Luchins and Luchins (72) reported that children 
in ability grouped classes felt stigmatized by being 
placed in the low ability group. The inference here 
is that feelings of inferiority were increased by abil- 
ity grouping. However, this study lacked a compa- 
rable group from heterogeneous or random grouped 
classrooms so it is possible that similar feelings of 
inferiority may have developed in low ability pupils 
in random grouped classes, although perhaps for 
different reasons. 

The following working hypotheses refer tothe 
fifth and/or sixth grade studies as indicated. It 
was hypothesized that: 

1. More evidence of aggression would be found 
in slow learning pupils who were randomly grouped 
than in slow learning pupils who were placed in abil- 
ity groups (fifth and sixth grade studies). 

2, There would be more evidence of depression 
in slow learning pupils who were randomly grouped 
than in slow learning pupils whowere ability 
grouped (fifth and sixth grade studies). 

3. There would be more evidence of inferiority 
feelings in slow learning pupils who were ability 
grouped than would be found in slow learning pupils 
who were randomly grouped (fifth and sixth grade 
studies). 

4. Superior pupils in random grouped class- 
rooms would obtain higher (less favorable) mean 
scores On aggression and depression than similar 
pupils in ability grouped classrooms (sixth grade 
study). 


PROCEDURE 


Parts of the California Test of Personality (CTP) 


elementary level, were administered to Sample IV 
pupils during their fifth grade year. This test is 
a fairly direct self-report type of personality inven- 
tory. Two subtests, Feeling of Belonging and With- 
drawing Tendencies from the Personal Adjustment 
areas; and one subtest, Antisocial Tendencies from 
the Social Adjustment area were used to test hypoth- 
eses inthis study. Reliability coefficients for the 
subtests of the California Test of Personality, ele- 
mentary level, that were employed in this research 
ranged from . 77 to . 83. 

As is the case with most direct self-report meas- 
ures, the items themselves present some evidence 
of content validity. Such measures, however, are 
dependent upon the accuracy of the subject’s insight 
and his willingness to provide honest answers. In 
the Utah study this measure was administered ina 
permissive atmosphere and pupils were given no 
motive for falsifying or faking their responses. In 
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the more important areaof concurrent validity little 
evidence is available on this measure. 

The Guilford- Martin Inventory of Factors GAMIN 
provides scores on five personality variables devel- 
oped through factor analyses by Guilford and his as- 
sociates (84). Two of the variables onthis measure 
purport to test characteristics that educators have 
often suggested were influenced by ability grouping. 
These are Factor A, which measures ascendancy in 
social situations as opposed to submissiveness and 
Factor I, which measures lack of inferiority feel- 
ings. Items measuring these two factors were ad- 
ministered to Sample VI pupils during their seventh- 
grade year, Sample VII pupils during their eighth- 
grade year, and Sample VIII pupils during their 
ninth-grade year. In each case, District A pupils 
had been exposed to the ability grouping treatment 
for a little more than one and one-half school years, 
Although this measure is usually not administered 
below grade nine, the reading level and content of 
the items appear generally appropriate for pupils as 
low as grade seven. Factor Acontains 38 items 
and the split-half reliability of this measure as re- 
ported in the test manual is .88. Factor I contains 
49 items and has a split-half reliability of . 89 (55). 

At the time that the test battery for this research 
project was being assembled, the Objective-Analyt- 
ic Personality Test Batteries by R. B. Cattell and 
his associates had been available to psychologists 
for about three years. These measures, although 
in an experimental form, appeared to offer certain 
advantages over both the group administered per- 
sonality inventories and the projective techniques. 
The factors forming the basis of this battery emerged 
from a series of factor analysis studies by Cattell 
and a number of other researchers. This research 
resulted in the identification of 18 personality fac - 
tors, the first 12 of which Cattell considers suffi- 
ciently invariant and clearly proved by research to 
be used with confidence (25). Fewer factors have 
been identified for children, and the factors that 
have emerged for children are based on fewer stud- 
ies than those for adults. Cattell has provideda 
number of individual and group tests that can be used 
to measure each personality factor in his battery. 
Children’s tests often differ from the adult tests in 
level of sophistication and in some cases in factor 
loadings. 

Two factors, taken from the 12 Cattell considers 
best established, were selected for use inthis re- 
search. The factors were UI-16 Competent Asser- 
tiveness, and UI-24, Anxiety to Achieve. Cattell 
advises the use of at least six tests to obtain each 
factor score. As the research situationinthe Utah 
study did not permit the use of individual tests, six 
group tests were selected for eachof the two factors. 
Tests selected for Factor UI-16 Competent Asser- 
tiveness were : G-6, G-20, G-27, G-42, G- 44a, and 
G-45. For Factor UI-24, Anxiety to Achieve, tests 
G-26, G-29, G-31, G-38, G-41, and G-47 were se- 


lected. These tests employ awide variety of 
approaches, but generally have no obvious relation- 
ship to the factor being measured and therefore are 
not fakable. 

The Californina Psychological Inventory (C PI) 
contains 480 items and yields scores on 18 person- 
ality variables. The 19 scales are classified by the 
test author into four broad areasor classes. Cross- 
validational studies for each of the 18 scales of the 
CPI are summarized in the test manual (49). 

Reliability coefficients on the 18 scales generally 
range around. 70 and are to be made between groups 
rather than individuals. 

Concurrent validities on the 18 scales are report- 
ed ina range from .26 to. 83, with the majority 
yielding correlations inthe .30to.50 range with 
the concurrent criterion measures (49). The valid- 
ity evidence for the CPI is substantial, although 
some of the validity coefficients are not as high as 
one might desire. When it is considered, however, 
that these coefficients are attenuated to a consider- 
able degree by the reliability of both the scales and 
criterion measures and in many cases represent 
correlations with criterion measures that would not 
be expected to be highly reliable, the reported valid- 
ities appear to be quite adequate for research pur- 
poses. Most of the concurrent validities are suffi- 
ciently high so that any marked influence of the 
grouping treatments upon these personality varia- 
bles should be revealed in the analysis. 


The Projective Measures 


In the Utah study, the investigators wanted the 
spontaneous reactions of the children to pictures 
that portrayed situations relatively pertinent to the 
child and which were selected to reveal the variables 
to be measured in this research. 

There were many tests and studies that were 
more or less pertinent to the needs of this study, but 
none of the tests completely met the specific testing 
needs, nor did any of the scoring methods serve as 
a model to follow. The final decision of the investi- 
gators was to select appropriate pictures from the 
variations of the TAT and other apperceptive meas- 
ures available. 

Cards from the various measures were evaluated 
by the investigators and members of the Department 
of Psychology at Utah State University. After con- 
siderable discussion and comparison, nine cards 
from the Thematic Apperception Test (TAT) (87) and 
the Michigan Picture Test (MPT) (83) were selected 
for use in the pilot study. Scoring procedures based 
on a five-point rating scale independently applied by 
two raters were developed in the pilot study. Pro- 
cedures in the fifth-grade study calledfor individual 
administration of the measure with the entire testing 
session tape recorded. Typed transcripts of the re- 
corded protocols were scored independently by the 
two raters. During the fifth-grade study, inter- 
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rater reliability coefficients were regularly calcu- 
lated on the blocks of protocols which had been 
scored between scorer conferences. This served 
as an objective check of the scorers’ adherence to 
scoring rationale and procedures, and prevented 
them from drifting apart in their procedures and 
operational definitions. The composite rater relia- 
bilities for the 144 protocols scored inthe fifth- 
grade study were : aggression, .95; depression, 
. 86; and inferiority feelings, . 90.15 


The sixth grade study, although building toa 
large extent upon the experience gained in the fifth 
grade study, required several procedural changes 
to adapt to the group testing situation. One new 
stimulus picture was employed, and the total num- 
ber was reduced to seven. The pictures were pre- 
sented to groups of sixth grade pupils by means of 
an opaque projector in a semi-darkened room. Pu- 
pils were required to write seven-minute stories 
for each of the seven TAT-type pictures, instead of 
responding orally as in the fifth-grade study. Hav- 
ing pupils write stories to TAT pictures instead of 
recording their responses according to the standard 
procedure considerably shortened the length of the 
stories elicited, and offered the raters fewer clues 
upon which to base a score. 


The scoring method developed for thefifth grade 
study was applied to the sixth grade sample of 338 
TAT-type story protocols, with the same two inves- 
tigators scoring for both studies. The resulting in- 
terrater reliability coefficients obtained for the 
sixth grade sample were: aggression,,. 91; de- 
pression, . 70; and inferiority, .73. Thesearelow- 
er than obtained in the fifth grade study, probably 
because of the shorter protocols obtained from the 
group administered test. 


Sample 


Several personality tests and related measures 
were employed in the Utah study. The samples to 
whom the various measures were administered are 
shown in Table 11. It will be notedthat most of 
these measures were administered during the junior 
high school years. The lack of suitable measures 
appropriate for administration at the elementary 
school level reduced the amount of datathat couldbe 
collected at this level. Onthe other hand, little 
personality data were collected at the high school 
level because of the limited differences between the 
grouping treatments as applied at the high school 
level in Districts Aand R. Asarule, all subjects 
in a given sample were administered the personality 
measures scheduled for use with that sample, ex- 
cepting the cases who were not available for either 
the regular or make-up testing. The projective 
measures, however, because of the expense of ad- 
ministration and scoring, were administered only 
to subgroups within Sample IV. 


RESULTS 


California Test of Personality 


Between-treatment comparisons on the Feeling 
of Belonging subscores of this measure showed Dis- 
trict R subgroups to be consistently higher than 
comparable District A subgroups. These differ- 
ences reached statistical significance in three of the 
six comparisons, one at each ability level. Com- 
parisons between pupils of different ability levels 
revealed a consistent tendency for pupils of higher 
ability to obtain more favorable scores on this sub- 
scale in both districts. These data suggest that pu- 
pils generally experienced a greater feeling of be - 
longing in District R at all ability levels than in Dis- 
trict A and also seem to indicate that superior pu- 
pils in both districts experienced a greater feeling of 
belonging than pupils of lower ability level. In in- 
terpreting these findings, the reader should bear in 
mind that, as the California Test of Personality was 
administered to Sample VI only, thefindings may be 
accepted with somewhat less confidence than would 
have been the case if consistent results had been 
obtained for several samples. Also, the absence of 
concurrent validity evidence and the few items em- 
ployed to measure each of the subscale areas would 
suggest that these results be interpreted with con- 
siderable caution. 

Comparisons between District A and District R 
pupils on the Withdrawing Tendencies subscale re- 
vealed only one statistically significant difference 
favoring average boys in District R. The non-sig- 
nificant differences on this subscale were all in the 
direction of higher scores for District R pupils, thus 
presenting a rather consistent picture. 

Between-treatment comparisons on the Anti-so- 
cial Tendencies subscale revealed four significant 
differences in the six comparisons made, all favor- 
ing the District Rgroups. District R boys had sig- 
nificantly better scores at all three ability levels, 
while girls were significantly different only at the 
average ability level. 


Inventory of Factors GAMIN 


Two of the personality factors from this inven- 
tory, Ascendancy and Inferiority, were hypothesized 
to be influenced by differences in the grouping treat- 
ments. Between-treatment differences on the As- 


cendancy score were generally small. Of the 18 


comparisons made between pupils in District A and 
District R on this factor, only two reached statisti- 
cal significance. Both of the significant differences 
were found in Sample IV, one favoring superior Dis- 
trict R boys and the other favoring slow District R 
girls. No consistent trends were apparent among 
the mean scores of comparable District A and 
R groups. Most of the differences between compa- 
rable groups were very small with 11 favoring Dis- 
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trict R groups and seven favoring District A groups. 
The lack of any consistent pattern inthe differences 
obtained on this score plus the fact that only two sig- 
nificant differences were obtained from 18 compari- 
sons seems to suggest that no true generalizable dif- 
ferences in the Acendancy score occurred as a re- 
sult of differences between the two grouping situa- 
tions. 

Between-treatment differences onthe Inferiority 
factor of the GAMINwere generally small. It was 
hypothesized that superior District A pupils would 
receive less favorable scores on this variable than 
comparable District R pupils. Although a slight 
trend was evident in the hypothesized direction, on- 
ly one of the six comparisons between scoresof su- 
perior pupils on this factor reached statistical sig- 
nificance. It was also hypothesized that slow pupils 
in District A would receive less favorable scores on 
this variable than their District Rcounterparts. 
The results showed no consistent pattern of differ- 
ences favoring either grouping treatment for slow 
pupils on the Inferiority factor. These data would 
lead to the conclusion that feelings of inferiority as 
measured by GAMIN Factor I are not consistently 
different in ability grouped andrandom grouged 
classrooms. 


OA Battery 


Analysis of data obtained on the Objective-An- 
alytic Personality Test battery required consider- 
able effort. Cattell (25) recommends that the var- 
ious tests going into a factor score be weighted 
equally, thus making it necessary to convert the 
raw scores on each test into standard scores and 
then add the standard scores for the subtests used 
in order to obtain a factor score. This procedure 
was followed to obtain a factor score for Factor UI- 
16, Competent Assertiveness. The performance of 
various Sample IV subgroups on this factor score 
was then compared, using analysis of variance. 

In general, comparisons between similar pupils 
in the two grouping treatments revealed very simi- 
lar means on this factor score. Only one of the six 
between-treatment comparisons revealed a statis- 
tically significant difference, District R slow girls 
obtaining a higher mean score (t = 3.90) than Dis- 
trict A slow girls. 

In order to gain some insights into the specific 
measures, the six tests selected to measure Cat- 
tell’s factor UI-24, Anxiety to Achieve, were ana- 
lyzed individually instead of combining them into a 
single factor score, Between-treatment compari- 
sons on these six tests for superior pupils revealed 
not a Single statistically significant difference 
among the 12 comparisons made. Similarly, com- 
parisons for slow pupils failed to reveal a sig nifi- 
cant difference. Only the comparisons between 
average boys in the two districts revealed a consist- 
ent pattern of significant difference. Of the six be- 


tween-treatment comparisons made for this group, 
four reached statistical significance, all favoring 

the District R pupils. One significant difference on 

Test G-47 was found in the between-treatment com- 
parison of average girls, again favoring the District 
R sample. 

Interpretation of the results onthe Cattell factors 
is difficult, as would be expected from the explora- 
tory nature of the measures involved. The general 
lack of consistent between-treatment differences, 
however, suggests that the factors these tests meas- 
ure are not being differentially affected by the abil- 
ity grouping and random grouping treatments. 


California Psychological Inventory (C PI) 


Between-treatment differences on theClass I 
variables of the CPI were generally small and incon- 
sistent. The reader will recall that the six person- 
ality variables included in this class are broadly de- 
fined by the test author as measures of poise, as- 
cendancy, and self-assurance. Of the 24 compari- 
sons between superior pupils in the two districts 
three reached statistical significance, all favoring 
Sample VII superior boys in District A over compa- 
rable District R pupils. As no significant differ- 
ences occurred between superior pupils in Sam ple 
VI, the Sample VII results must be considered ten- 
tative. Comparisons between slow pupils in the two 
districts revealed only two significant differences , 
one favoring District R pupils and one favoring Dis- 
trict A pupils. 

Comparisons between average pupils revealed 
eight of the 24 between-treatment differences to be 
statistically significant. All eight of these differ- 
ences indicated higher scores for District R pupils. 
Five of these differences occurred in Sample VIand 
three in Sample VII, suggesting some consistency in 
the results. No significant differences onthese var- 
iables were hypothesized for average pupils and the 
investigator has no explanation for the fairly consist- 
ent tendency for District R pupils of average ability 
to attain more favorable scores on the Class I CPI 
personality variables. _—_- 

Class III personality variables on the CPI include 
measures of achievement potential and intellectual 
efficiency. Three scores are obtained in this area: 
Achievement via conformance (Ac), Achievement via 
independence (Ai), Intellectual efficiency (Ie). Itwas 
hypothesized that both superior and slow pupils would 
make higher scores on Achievement via conformance 
in District A. One of the four comparisons between 
superior pupils on this variable reached statistical 
significance in the hypothesized direction. No sig- 
nificant differences between slow pupils were found. 
It was hypothesized that District A superior pupils 
would receive higher scores on Intellectual efficien- 
cy. Although District A superior pupils obtained 
higher scores on all four of the comparisons on this 
variable, only one of the differences approached 


> 


This content downloaded from 129.252.86.83 on Tue, 12 Jul 2016 22:32:32 UTC 
All use subject to http://about.jstor.org/terms 


82 ABILITY GROUPING 


Significance. 

It was hypothesized that superior District R pu- 
pils would obtain more favorable scores on Achieve- 
ment via independence. The results, however, 
showed that District A superior pupils made higher 
scores on this variable in all four of the compari- 
sons, one of which reached statistical significance. 
The only four between-treatment results that were 
reasonably consistent for the Class III variable were 
between superior boys in Sample VII. District A pu- 
pils in this sample were significantly higher in Ac 
and Ai and also obtained higher scores, approach - 
ing significance on Ie. 

Class IV of the CPI contains three scores class- 
ified by the test author as measures of intellectual 
and interest modes. It was hypothesized that Dis- 
trict R pupils at all ability levels would develop high- 
er scores on flexibility than comparable District A 
pupils. Two significant differences were found be- 
tween superior pupils on Flexibility. As one of these 
significant differences favors each treatment, they 
tend to cancel each other out, thus leading torejec- 
tion of the working hypothesis. No significant be- 
tween-treatment differences emerged on the flexi- 
bility score for average and slow pupils. 


The Projective Measures 


The fifth grade study involved com paring the 
scores made by 71 District A slow pupils and 73 
District R slow pupils on aggression, depression 
and inferiority feelings. None of the between-treat- 
ment differences were statistically significant. 

Inthe sixth grade study pupils at allthree ability 
levels were compared onthe same three variables. Su- 
perior and slow pupils inthe random grouped situa- 
tion were expected to obtain higher mean scores on 
aggression and depression than similar pupils in 
ability grouped classrooms, as aneffect of frustra- 
tions imposed by the school situation often consid- 
ered to be geared primarily tothe needs of the aver- 
age pupil. This hypothesis was basedon the as- 
sumption, derived from Mathias’ (81) study, that 
the wide spread of individual differences within the 
heterogeneously grouped classroom would frustrate 
both the bright and slow pupils and cause a greater 
amount of aggression and depression inboth groups. 
The only finding in keeping with this hypothesis was 
that District R slow girls demonstrated significantly 
more depression than slow girls in District A. On 
aggression, the only significant difference obtained 
was in the direction opposite that hypothesized, Dis- 
trict A superior boys having scored significantly 
higher on this variable than comparable District R 
boys. The fact that the relationshipisinverse, and 
does not hold for the slow pupils, casts serious 
doubt upon Mathias’ interpretation of his findings. 
Mathias’ (81) and Luchins and Luchins (72) findings 
applied to both girls and boys asagroup. However, 
in the present findings a sex differential is present 


that suggests that inferences concerning the effects 
of a grouping practice upon any particular group of 
school children should consider the sex of the pupils 
as well as their level of academic achievement. 

Regarding inferiority feelings, it was hypothe- 
sized that ability grouped slow pupils would react to 
being classified as ‘‘dumb’’ by obtaining higher mean 
scores on this variable than similar heterogeneous- 
iy grouped pupils. The findings of the sixth grade 
study revealed no significant differences. Actually, 
both boys and girls received somewhat higher infe- 
riority scores in District R, although these differ- 
ences were well short of statistical significance. 
Thus, the claim that ability grouped slow pupils 
would react to the stigma of ‘‘being dumb’’, with in- 
creased feelings of inferiority, as advanced by Lu- 
chins and Luchins (72) was refuted. Data from the 
Utah study suggest that the methodof grouping prob- 
ably is not a significant factor in the development of 
feelings of inferiority among elementary school chil- 
dren. It might be that inferiority is a very basic 
characteristic and not as easily influenced as other 
more ‘‘superficial’’ personality variables. Onthe 
other hand, low ability pupils, grouped as such, may 
experience an increase in ego strength and feelings 
of adequacy due to less peer competition and great- 
er chances for success. Ability grouping may also 
orient the pupil’s parents to a better understanding 
of his potential and reduce parental expectation. It 
is possible that the aforementioned factors may off- 
set any undesirable effect that the ‘‘dumb’’ label may 
have upon the slow pupil. 

A final possibility that applies whenever negative 
results are found, is that the measures currently 
available are not sufficiently sensitive to reveal dif- 
ferences that might have developed in the two treat- 
ments. 


SUMMARY AND CONCLUSIONS 


California Test of Personality (CTP) 


This measure was administered to Sample IV 
only. 

It was hypothesized that superior and slow pupils 
in ability grouped classes would obtain higher scores 
on the Feeling of Belonging subscale. Results 
showed District R subgroups to be consistently high- 
er on this score at all ability levels. These data 
suggest that ability grouping does not lead to agreat- 
er feeling of belonging on the part of pupils at any 
ability level but instead provides aless favorable 
climate than random grouping. 

A lack of significant between-treatment differ- 
ences on the Withdrawing Tendencies subscale sug- 
gests that this characteristic was not influenced by 
the grouping treatments. 

Ability grouped pupils received less favorable 
scores on Anti-Social Tendencies inallsix between- 
treatment comparisons, withfour differences reach- 
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ing statistical significance. The results suggest 
that ability grouped pupils, particularly boys, re- 
port significantly more Anti-Social Tendencies than 
do comparable pupils in random grouped class- 

rooms. These differences were present at all three 
ability levels. 


Inventory of Factors GAMIN 


Two personality factors from this inventory, As- 
cendancy and Inferiority, were measured for pupils 
in Samples VI, VII, and VIII during the second year 
of the Utah study. These data, basedonthree sam- 
ples, may be accepted with more confidence than 
the CTP data discussed above. 

Of the 18 between-treatment comparisons made 
on the Ascendancy factor, only two reached statis- 
tical significance. These results suggest that the 
two treatments had no differential effect upon this 
personality variable. 

No consistent between-treatment differences 
emerged on the Inferiority factor ofthe GAMIN. 
These results support the conclusion that ability 
grouping is no more likely to develop inferior ity 
feelings in pupils at any ability level thanis random 
grouping. This is particularly noteworthy in the 
case of slow pupils because of the frequently stated 
opinion that ability grouping does develop inferior- 
ity feelings among such pupils. These data, based 
on three different samples at different grade levels, 
are more extensive than any previous evidence pub- 
lished concerning this question. 


Objective-Analytic Personality Test Factors (OA) 


Tests measuring two factors from this battery 
were administered to Sample IV and Sample VI. No 
consistent pattern of between-treatment differences 
emerged on either of these variables. We maycon- 
clude that Competent Assertiveness and Anxiety to 
Achieve as measured on Cattell’s OA battery were 
not affected differentially by tne ability grouping aiid 
random grouping treatments. 


California Psychological Inventory (CPI) 


Tne CPI was admininstered to pupils in Samples 
VI and VII. The 18 CPI variables are classified in- 
to four broad areas. Class I variables include 
measures of poise, ascendancy, and self-assurance. 
Between-treatment comparisons on the Class I var- 
iables snowed no consistent superiority for either 
treatment for superior pupils and slow pupils. Com- 
parisons between average pupils revealed eight of 
the 24 between-treatment differences to be statis- 
tically significant, all favoring District R pupils. 
From these data we may concludethat the two 
grouping treatments do not appear to differentially 
affect personality variables related to poise, as- 
cendancy, and self-assurance, excepting at the aver- 


age ability level where random grouped pupils 
showed some tendency toward more favorable 
scores. 

Class III variables on the CPI include three 
scores aimed at measuring Acnievement via con- 


formance (Ac), Achievement via independence (Ai), 


and Intellectual efficiency (Ie). Between-treatment 

differences on these three variables for subjects in 

Samples Vl and VII were generally non-significant. 

There was some tendency for superior DistrictA 
boys to obtain more favorable scores thancompara- 
ble District R boys on tnese variables with two of 

the six comparisons between these groups reaching 

statistical significance andthe remaining four differ- 
ences following this same trend. With the possible 

exception of superior boys, it is concluded that the 

grouping treatments had no influence upon the broad 

area of achievement potential and intellectual effi- 
ciency aS measured on the CPI. 

Class IV of the CPlis concerned with intellectual 
and interest modes and is madeupofthree subtests: 
Psychological- mindedness (Py), Flexibility (Fx), 
and Femininity (Fe). No consistent between-treat- 
ment differences emerged on any of these tnree 
scores leading us to the conclusion that ability group- 
ing and random grouping have no differential effect 
upon the Class IV CPI variables. 


Projective Data 


The first phase of the projective research, refer- 
red to as the fifth grade study, measured aggres- 
sion, depression, and inferiority feelings on an in- 
dividually administered TAT type projective meas- 
ure using a Sample of 144 slow pupils taken from 
ability grouped and random groupedfifth grade 
classes. The second phase, referred to asthe 
sixth grade study, employed a similar groupadmin- 
istered projective measure, using a sample of 338 
slow, average and superior pupils takenfrom ability 
grouped and random grouped sixth gradeclass- 
rooms. 

Between-treatment comparisons on aggression, 
depression and inferiority feelings inthe fifth grade 
study showed no statistically significant differences. 
Between-treatment differences on these three per- 
sonality variables obtained in the sixth grade study 
also failed to show consistent significant differences 
between slow pupils on these three personality var- 
iables. 

From the projective data it may be concluded that 
ability grouping and random grouping do not differ- 
entially affect the scores of slow pupils on aggres- 
sion, depression and inferiority feelings. The lack 
of significant differences ininferiority feelings sup- 
port the data obtained on other samples using the in- 
feriority subscore of the GAMIN. Taken together, 
these results present strong evidence that ability 
grouping as employed in the Utah study does not 
cause the development of inferiority feelings among 
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slow pupils. 

The projective data obtained inthe sixth grade 
study at all ability levels failed to support Mathias’ 
(81) contention that aggression and depression are 
higher in random grouped classes for superior and 
slow pupils. The few significant between-treatment 


differences found on aggression and de pression 
were not supported by data obtained inthe fifth 
grade study. Thus, the most logical conclusion 
would be that these variables are not influenced by 
the ability grouping or random grouping treatments 
employed in the Utah study. 
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A SUMMARY: CONCLUSIONS 


THE GENERAL GOAL of this research has been 
to study differences in the effects upon elementary, 
junior high school, and high school pupils of an 
ability grouping system that differentiated the cur- 
riculum principally by adjusting the rate of presen- 
tation of curricular materials, andarandom group- 
ing system that differentiated the curriculum prin- 
cipally through the use of enrichment. Two adja- 
cent and closely comparable school districts in Utah 
provided the setting in which these differences were 
explored. One of these districts employed random 
grouping with enrichment, andthe other had adopted 
a system of ability grouping with acceleration, co- 
incident with the start of this research. 

Over 2,500 pupils from the fourth , sixth , sev- 
enth , eighth , and ninth grade levels were select- 
ed in the two districts at the beginning of the study. 
During the second year of the study, this sample 
was increased to about 4,000 pupils. Research da- 
ta were collected over a four-year period in order 
to appraise the long term effects of the two group- 
ing treatments. Pupils who were first tested early 
in grade four were followed through grade seven. 
As all samples were similarly followed over the pe- 
riod of the study, data were collected at all grade 
levels from four through twelve. The district that 
employed ability grouping is referred to throughout 
this report as District A. The other district par - 
ticipating in this research, known as District R, 
employed random or heterogeneous grouping. 

Three major types of differences were consid- 
ered in analyzing most of the data. Thefirst and 
most important comparisons were between compa- 
rable pupils in the two grouping treatments. These 
comparisons involved pupils of the same sex and 
ability level in the ability grouped andrandom 
grouped samples and are referred to as between- 
treatment differences. These differences provide 
the best evidence on the differential effects of abil- 
ity grouping and random grouping upon the depend- 
ent variables. Comparisons between pupils of the 
Same sex and in the same grouping treatment who 
differed in ability level are referred to as level or 
within-treatment differences. These comparisons 
tell us whether the grouping treatments hada differ- 
ential effect upon pupils at different ability levels. 
The final classification used in the analysis was sex. 
Boys and girls who were in the same grouping treat- 
ment and at the same ability level were compared to 
provide knowledge about the sex-related differences 
on the dependent variables. 


ACHIEVEMENT 


Achievement data were collected for all samples 
and gains from year to year as well as gains over 
the entire four years were analyzed. This proce- 
dure provided a large body of data on the nature of 
long term achievement in the two grouping treat- 
ments. Also, because achievement data for two or 
more samples were available at most grade levels, 
extensive cross checking of the results was possi- 
ble. During the first year of the study, the Califor- 
nia Achievement Test was used, while in subsequent 
years, measures of mathematics, science, social 
studies and reading from the STEP battery were 
administered. 


Elementary 


Achievement in four subject areas as well as ov- 
erall achievement were compared during the ele- 
mentary school years for these samples. A total of 
04 statistical comparisons based on analysis of var- 
iance and covariance were made between ability 
grouped and random grouped pupils. Of these 54 
comparisons, 28 revealed statistically sig nificant 
differences. Nineteen of these differences were fav- 
orable to District A pupils while nine favored Dis- 
trict R pupils. Of the 19 differences favoring ability 
grouped pupils, 15 occurred during the first year of 
the study. If these first year differences had been 
due to a substantial superiority of ability grouping 
over random grouping, we would have expected the 
differences to grow larger each year asthe cumula- 
tive effects of the better system widened the achieve- 
ment gap. No such cumulative effects occurred and 
in fact, many of the achievement differences favor- 
ing District A disappeared by the time Sample IV pu- 
pils had completed the sixth grade. Thus, we may 
conclude that neither ability grouping with accelera- 
tion nor random grouping with enrichment is more 
effective for all ability levels of elementary-school 
pupils. When data for the different ability levels 
were considered separately, achievement advan- 
tages of the two grouping systems, though small, 
tended to favor ability grouping for superior pupils 
and random grouping for slow pupils. As was hypoth- 
esized, the achievement results for average pupils 
did not consistently favor either grouping treatment. 


Junior High School 
All five of the samples employed inthis study 
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were in junior high school at some time during the 
four years of the project. When we combine the 
achievement analyses for these five samples, we 
find that a total of 60 statistical comparisons were 
made between comparable groups in the two group- 
ing treatments; 33 in mathematics and 27 in science. 
Of the differences in mathematics achievement, five 
were significant in favor of District A subgroups, 
five in favor of District R, and the remaining 23 
were nonsignificant. Of thescience comparisons, 
five differences were significant favoring District A, 
one favoring District R and the remaining 21 were 
nonsignificant. 

A study of the results at each of the three ability 
levels indicates that there was some tendency for 
ability grouping to lead to greater mathematics 
achievement among superior pupils andgreater sci- 
ence achievement among average pupils. Among 
slow pupils, the differences between the two group- 
ing treatments were small, buttendtofavor District 
R in both mathematics and science. 


OVERACHIEVERS AND UNDERACHIEVERS 


In this phase of the Utah study, samples of fourth 
and sixth grade pupils from random grouped and 
ability grouped classrooms were classified as over- 
achievers, underachievers and normal achievers 
based on the relationship between their achievement, 
chronological age, and mental age. The proportions 
of overachievers, normal achievers and under- 
achievers among pupils of different ability levels in 
the two grouping systems were then compared. 

There was a Significant tendency for girls more 
frequently to be overachievers and less frequently to 
be underachievers than boys. This trend was pres- 
ent at both grade levels in both treatments. 

Overall comparisons between District Aand R 
indicated that there was a consistent trend for abil- 
ity grouped pupils to be more often classified as over- 
achievers and less often as underachievers. Incom- 
paring pupils at each of the three ability levels, we 
find that differences between superior pupils in the 
two districts were statistically significant at both 
fourth and sixth grades, with District A having more 
overachievers and a smaller percentage of under - 
achievers than District R. No significant differ - 
ences were found between the proportions of over - 
achievers, underachievers, and normal achievers 
among pupils of average or low ability in the two 
districts. 


STUDY METHODS AND ATTITUDES 


Considerable evidence concerning study habits 
was collected over the four years of the Utah study. 
Pupils in Samples IV, VI, VII, and [IX were admin- 
istered study habits measured at grade levels rang- 
ing from seven to 12. Two measures were used, 


the California Study Methods Survey (CSMS) which 


yields three subtest scores, and the Survey of Study 


Habits and Attitudes (SSHA) which yielda total score 


only. 

The Sample IV results on the CSMS showed all of 
the total score differences and seven of the nine sub- 
test differences between comparable groups in Dis- 
tricts A and R to be statistically significant. Differ- 
ences favored District R and were generally largest 
between pupils of superior ability. These results 
lead to the conclusions that pupils inarandom 
grouping situation consistently develop better study 
methods during the elementary school years than pu- 
pils in an ability grouping situation. 

Sample VI average pupils in District R received 
consistently higher scores on the CSMS than com- 
parable District A pupils. No significant between- 
treatment differences were present for slow or su- 
perior pupils. Sample VIII average pupils in Dis- 
trict R received higher SSHA scores. No signifi- 
cant between-treatment differences emerged from 
the Sample IX data. It may be concluded that if any 
advantage accrues from heterogeneous grouping at 
the secondary level, it is for average pupils,who 
made significantly greater gains in study habits in 
Samples VI and VIII. 


SOCIOMETRIC CHOICE 


A near-sociometric measure ofthe usual partial- 
rank-order type was developed for use inthis study. 
Three criteria were employed, each with five posi- 
tive choices. Each subject was asked to indicate by 
placing a check mark beside names onaroster of 
his classmates, those whom he preferredunder each 
criterion. The three criteria were: 1) the five chil- 
dren in the class who are your best friends; 2) the 
five children with whom you prefer to study; and 3) 
the five children whom you would most like to have 
with you if you were transferred to another class- 
room. Each pupil’s sociometric status was obtained 
by summing the choices he received onthe three cri- 
teria. After obtaining a sociometric choice score 
for each pupil, the pupils were classified as stars, 
regulars, neglectees, or isolates. 

Sample VI pupils were administered the socio- 
metric measure only once — inthe sixth grade- 
and therefore, this sample was used only in the 
study of sociometric status patterns. SampleIV pu- 
pils were measured in the fourth, fifth, and sixth 
grades, and data from this sample were used to 
study sociometric status patterns, mobility of socio- 
metric status, and relationships between sociomet- 
ric status and other variables. 

The sociometric data collected seem to provide 
considerable evidence relating to sociometric choice 
in ability grouped and random grouped classes dur- 
ing the intermediate grades. Analysis revealedthat 
the overall proportions of stars, regulars, and neg- 
lectees in District A and District R were similar, 
thus leading to the conclusion that ability grouping 
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did not result in a permanent leadership vacuum in 
groups of average and slow pupils. 

Superior students were found tolose some socio- 
metric status when placedin ability grouped class- 
rooms. This loss was particularly evident inthe 
star classification but was not accompanied by any 
increase in the neglectee-isolate classification. Da- 
ta on mobility of sociometric status suggest that su- 
perior pupils regain some of this status loss after 
they have made an adjustment to the ability grouped 
situation. Average and slow pupils appear to have 
afar better chance of gaining social recognition in 
ability grouped classrooms than do comparable pu- 
pils in random grouped classrooms. For the slow 
pupil, ability grouping not only appears to increase 
the pupil’s chances of being classified asa star, but 
also reduces his chances of being classified as 
a neglectee-isolate. Sociometric status mobility 
patterns between District Aand Rdemonstratea 
greater number of status changes inability grouped 
classrooms. The trend of these changes is general- 
ly toward nigher sociometric status, further sup- 
porting the conclusion that organization of pupils in- 
to ability grouped classes brings about marked 
changes in sociometric status patterns during the 
first one or two years and generally leads to per- 
manently higher status patterns for average and slow 
groups. 

In random grouped classrooms, the pupil’s abil- 
ity level appears to be an important factor in deter- 
mining his sociometric status. Sociometric status 
patterns differ considerably for pupils of different 
ability levels in random grouped classrooms, with 
pupils of higher ability consistently obtaining more 
favorable mean scores. The presence of significant 
correlations obtained when sociometric status was 
related to ability and achievement for District R pu- 
pils further demonstrates the importance of ability 
to status in the random grouped class. Onthe other 
hand, ability does not appear to be animportant fac- 
tor in determining sociometric statusin ability 
grouped classrooms. In the Utah study, sociomet- 
ric status patterns for pupils of different ability lev- 
els in ability grouped classrooms were closely com- 
parable, showing no evidence of real differences. 
The absence of correlations between ability and sta- 
tus suggest that these factors are far less im por- 
tant to friendship choices in ability grouped class- 
rooms. 

We were also interested in learning what hap- 
pened to a pupil’s social status in the two grouping 
systems when his ability level changed. In District 
A such a change meant a change in classroom as- 
signment, while in District R the pupil would be un- 
aware of the change. First let us consider pupils 
who received a lower ability level classification in 
a succeeding year. District A pupils in this class- 
ification experienced significantly more upward mo- 
bility than was experienced by the comparable Dis- 
trict R pupils. Thus, thedownward movement 


seemed to increase the ability grouped pupil’s rela- 
tive social status as he was recognized as having 
come from a higher ability group and was near the 
topof his new group whereas he had been near the 
bottom of his previous group. These data would 
suggest that any damage to the ability grouped pu- 
pil’s self-concept that might occur because of his 
reclassification to a lower ability section might be 
compensated to some extent by gains in social status 
he is likely to experience in his new class. 

Pupils in District A and District R who moved 
upward in ability level classificationwere also stud- 
ied. Two of the three chi-square comparisons 
made between these pupils yielded statistically sig- 
nificant differences. Ineachinstance, pupils in 
random grouped classrooms showed more favorable 
sociometric status changes than comparable pupils 
in District A classrooms. These findings suggest 
that although ability grouping might initially provide 
greater opportunities for social leadershipfor those 
of average and low ability, it also might make more 
difficult the task of maintaining or gaining social 
recognition if the pupil makes achievement gains 
sufficient to raise his ability level classification. 
Loss of social status that is likely to accompany his 
reassignment to a new classroom might outweigh 
any gain in status he may experience from being 
identified as a member of the brighter group. 

The lack of any consistent significant differences 
between boys and girls suggests that the effects of 
ability grouping and random grouping on sociometric 
variables are about the same for the two sexes. 

Low but significant correlations were found to ex- 
ist between sociometric status, pupil attitude, and 
self-concept variables. These correlations, though 
low, demonstrate the presence of relationships be- 
tween the pupil’s social acceptance and his percep- 
tion of himself, his school environment, and his 
peers. 


PUPIL ATTITUDES 


Pupils in Sample IV were administered the USU 
School Inventory during their sixth gradeyear. This 
measure contains three scales: Attitude Toward 
Peers, Attitude Toward the Teacher, and Attitude 
Toward School. 

Results on the Attitude Toward Peers scale re- 
vealed no significant between-treatment differences. 
Comparison on the Attitude Towardthe Teacher 
scale showed that superior girls and boys and slow 
boys in District A received significantly more fav- 
orable scores than comparable District A pupils. No 
significant differences were found between average 
pupils in the two districts. Thus, ability grouping 
appears to be associated with more favorable atti- 
tudes toward the teacher among both superior and 
slow pupils. 

Comparisons between the two treatments on the 


Attitude Toward School scale showed one significant 
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difference favoring slow boys in District A. Within 

both treatments, there were significant differences 

between pupils of different ability levels all indica- 

tive of more favorable attitudes for pupils of higher 
ability. Three of the six sex differences of the At- 

titude Toward School scale were statistically signifi- 
cant. These differences, all of whichfavored girls, 
were largest at the superior level. 


PUPIL PROBLEMS 


The instruments employed to study pupil prob- 
lems were the SRA Youth Inventory andthe SRA Jun- 
ior Inventory. These measures are essentially 
check lists each of which classifies the problems 
listed into a number of areas. A study of the prob- 
lems listed on the two inventories revealed several 
areas in which one of the grouping treatments em - 
ployed in the Utah study might be expected to differ 
from the other. 

The SRA Junior Inventory, form S was adminis- 
tered to Sample IV pupils in District Aand R during 
their fifth grade year. The SRA Youth Inventory, 
form A was administered to Sample VI pupils during 
their seventh grade year, Sample VII pupils during 
their eighth grade year, and Sample VIII pupils dur- 
ing their ninth grade year. As theSRA Youth Inven- 
tory was administered to three different samples at 
the junior high school level, the results onthis 
measure provide fairly reliable evidence. 

In reviewing the between-treatment differences 
on the problem areas of the Youth Inventory, per- 
haps the most noteworthy trend is the preponderance 
of significant differences indicating larger mean 
problem scores for the District R groups. Atotal 
of 32 between-treatment differences reached or ap- 
proached statistical significance for the junior high- 
school samples. These differences were found in 
all samples. Of these 32 differences, only one in- 
dicated more favorable problem scores for District 
R pupils while the other 31 indicated more favorable 
problem scores for District A. 

The frequency of significant differences between 
the districts was also found to be related to ability 
level. Of the 32 aforementioned differences on the 
SRA Youth Inventory, 15 were found between su pe- 
rior pupils in the two districts, and in 14 of these, 
the District R pupils obtained higher (less favorable) 
scores. Eleven of the 32 differences were between 
average pupils in the two districts; all indicating 
higher problem scores for District R groups. These 
results, reflecting as they do aconsistently high 
number of problems for superior and average Dis- 
trict R groups, would suggest the presence of true 
treatment differences at these ability levels. This 
trend, although present for slow pupils, was far less 
pronounced, and less general from sample to sam- 
ple. These data support the overall conclusion that 
ability grouped junior high school pupils in average 
or superior groups report fewer problems than do 


comparable pupils in random grouped classes. 
SELF-CONCEPT 


The principal measure of self-concept employed 
in this research was the Index of Adjustment and 
Values, developed by Robert E. Bills (11). Pupils 
in all five samples were administered the IAV. Sam- 
ples IV and VI were each tested three times over a 
three year period, while Samples VU, Villand IX 
were each tested twice over a two year period. In 
the IAV the pupil answers three questions about him- 
self: he rates himself with respect to each of the 
traits listed (concept-of- self), tells how hefeels 
about being this sort of a person (acceptance -of- 
self), and how he would like to be withrespect to 
each trait (ideal-self). In addition to yielding these 
three scores, a fourth score, Discrepancy, was ob- 
tained by subtracting the Concept of Self scores from 
the Ideal Self score. 

For each of the four IAV variables, 66 between- 
treatment comparisons were made, 22 at each abil- 
ity level. 

Of the 66 comparisons onthe Concept of Self 
score, 23 reached or approached statistical signifi- 
cance. All but two of these 23 differences were in 
a direction favoring District R pupils. Eight were 
between superior pupils in the two districts, ten be- 
tween average pupils, and five between slow pupils. 
Fifteen of these 23 differences were between girls’ 
groups while only seven were between boys’ groups. 
The extent and consistency of differences in the Con- 
cept of Self data seem to justify the conclusion that 
at all ability levels, pupils in random grouped class- 
rooms have more favorable concepts of self than 
comparable pupils in ability grouped classrooms. 
With respect to concept of self, the two grouping 
treatments had a somewhat greater effectupon girls 
than boys. 

Of the 66 between-treatment comparisons of the 
IAV Acceptance of Selfmean scores, 15 were statis- 
tically significant and seven others approached sig- 
nificance. Six of the significant differences were 
between superior pupils, six between average pu- 
pils, and three between slow pupils in the two dis- 
tricts. All 15 of these significant between-treat- 
ment differences favored District Ras didfour of 
the seven differences that approached significance. 
Based on these data, it must be concluded that ran- 
dom grouping is consistently related to higher self 
acceptance for pupils at all ability levels and over 
most of the grade levels covered in this project. 

The overall summary of between-treatment com- 
parisons on the [AV Discrepancy Score shows 14 of 
the 66 differences to be statistically significant. Six 
were between superior pupils in the two districts. 
Six of these differences were between average pupils. 
All but one of the 14 significant differences showed 
higher (less favorable) Discrepancy Score means for 
District Asubgroups.” © 
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It may be concluded that the two grouping treat- 
ments have a differential effect uponthe pupils’ Dis- 
crepancy Scores with ability grouping generally as- 
sociated with less favorable scores. This pattern 
is most pronounced in the differences between the 
mean scores of superior pupils inthe two treatments, 
closely followed by average pupils. The differences 
between slow pupils are substantially smaller. 

Of the 66 between-treatment comparisons of mean 
Ideal Self scores on the Index of Adjustment and Val- 
ues, eight reached statistical significance. Seven 
of these eight differences favored District R pupils, 
with two occurring between superior groups, four 
between average groups andtwo between slow 
groups. The relatively few significant results sug- 
gests that the grouping system has little influence 
upon the Ideal Self score. 

Sample IV pupils in District A who changed abil- 
ity levels between the fifth and sixth grade admin- 
istrations of the [AV were compared to deter mine 
whether changes in ability classification would be 
accompanied by changes in self-concept scores. The 
results indicated that remaining in the same ability 
classification in an ability grouping system appears 
to be less threatening to the pupil’s self-concept 
than reclassification to either a higher or lower sec- 
tion. Placement of a pupil in a lower ability class- 
ification appears to bring about lower scores inCon- 
cept of Self, Acceptance of Self, and Ideal Self. Pu- 
pils destined to be reassigned toalower ability 
classification were found to have lower scores in 
Concept of Self and Acceptance of Self prior to re- 
assignment. This lower initial score may indicate 
that misassignment of a pupiltoa higher group than 
his ability would justify brings about a lowering of 
his Concept of Self and Acceptance of Self because 
of his inability to perform satisfactorily in his as- 
signed group. 


PERSONALITY 


Variables from four personality inventories were 


employed: The California Test of Personality 
(CTP); Inventory of Factors GAMIN (GAMIN) Objec- 
tive- Analytic Personality Tests (OA); and the Cali- 
fornia Psychological Inventory (CPI). A projective 
measure employing cards from the Thematic Apper- 
ception Test (TAT) and the Michigan Picture Test 
(MPT) was also used to measure three variables 
that seemed particularly important. 


California Test of Personality (CTP) 


This measure was administered to Sample IV on- 
ly. It was hypothesized that superior and slow pu- 
pils in ability grouped classes would obtain higher 
scores on the Feeling of Belonging subscales. Re - 
sults showed District R subgroups to be consistently 
higher on this score at all ability levels. These da- 
ta suggest that ability grouping does not lead toa 


greater feeling of belonging on the part of pupils at 
any ability level, but instead provides a less favor- 
able climate than random grouping. 

Ability grouped pupils also received less favora- 
ble scores on Anti-social Tendencies in all six be- 
tween-treatment comparisons, with four differences 
reaching statistical significance. 


Inventory of Factors GAMIN 


Two personality factors from this inventory, As- 
cendancy and Inferiority, were measured for pupils 
in Samples VI, VII, and VIII during the second year 
of the Utah study. Of the 18 between-treatment 
comparisons made on the Ascendancy factor, only 
two reached statistical significance. These results 
suggest that the two treatments had no differential 
effect upon this personality variable. No consistent 
between-treatment differences emerged onthe In- 
feriority factor of the GAMIN. Our findings support 
the conclusion that ability grouping is no more like- 
ly to develop inferiority feelings in pupils at any 
ability level than is random grouping. 


Objective- Analytic Personality Test Factors (OA) 


Tests measuring two factors from this battery 
were administered to Sample IV and Sample VI. No 
consistent pattern of between-treatment differences 
emerged for either of these variables. We may con- 
clude that neither Competent Assertiveness nor Anx- 
iety to Achieve as measured on Cattell’s OA battery 
were affected differentially by the two grouping treat- 
ments. 


California Psychological Inventory (CPI) 


The CPI was administered to pupils in Samples 
VI and VIL. The 18 CPI variables are classified by 
the test author into four broad areas. Class I vari- 
ables include measures of poise, ascendancy, and 
self-assurance. Between-treatment comparisons on 
the Class I variables showed no consistent superior- 
ity for either treatment for superior pupils and slow 
pupils. Average pupils received more favorable 
scores in random grouped classes. 

Class II variables on the CPlinclude three 
scores aimed at measuring Achievement via conform- 
ance (Ac), Achievement via independence (Ai) and 
Intellectual efficiency (Ie). Between-treatment dif- 
ferences on these three variables for subjects in 
Samples VI and VII were generally nonsignificant al- 
though there was some tendency for superior pupils 
in District A to obtain more favorable scores. 

Class IV of the CPlis concerned with intellectual 
and interest modes and is made up of three subtests: 
Psychological- mindedness (Py), Flexibility (Fx), 
and Femininity (Fe). No consistent between-treat- 
ment differences emerged on any of these three 
scores. 
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Projective Data 


The first phase of the projective research, re- 
ferred to as the fifth grade study, measured ag- 
gression, depression, and inferiority feelings onan 
individually administered TAT type projective 
measure using a sample of 144 slow pupils taken 
from ability grouped and random grouped fifth-grade 
classes. The second phase, referred toasthe 
sixth grade study, employed a similar group admin- 
istered projective measure, using a sample of 338 
slow, average, and superior pupils takenfrom abil- 
ity grouped and random grouped sixth grade class- 
rooms. 

Between-treatment comparisons on aggression, 
depression, and inferiority feelings inthe fifth 
grade study showed no statistically significant dif - 
ferences. Between-treatment differences on these 
three personality variables obtained inthe sixth 
grade study also failed to show consistent signifi- 
cant differences. 

The lack of significant differences in inferiority 
feeling support the data obtained on other samples 
using the inferiority subscore on the GAMIN. Taken 
together, these results present considerable evi- 
dence that ability grouping as employed in the Utah 
study does not cause the development of inferiority 
feelings among slow pupils. 


OVERALL APPRAISAL OF THE GROUPING 
TREATMENTS 


Let us now attempt to weigh the overall effects 
of the two grouping treatments on superior, average, 
and slow pupils at the elementary and junior high 
level. Table 20 provides a brief summary of these 
effects. Because of the limitations of the high 
school data collected in the Utah study, such an ov- 
erall appraisal does not appear appropriate at that 
level. Any such appraisal of the two grouping treat- 
ments depends to a considerable degree upon the 
value system of the person making the appraisal. In 
other words, different persons would assign differ- 
ent amounts of importance to the various types of 
behavior and experience that were appraised by the 
measures in the Utah study. The following evalua- 
tion, therefore, reflects the value system of the in- 
vestigator. Educators having different orientations 
may well draw different overall conclusions from 
the findings of this research. 


Elementary 


Superior Pupils: The superior pupil generally 
showed greater achievement gains in ability grouped 
classes. The differences in the Utah study were not 
large except for the first year, butdatafor the four 
years reflected a significant overall advantage for 


the ability grouping system. The data on over- 


achievers supported this finding, but as this phase 


of the research was carried out for only one year, 
the results must be considered tentative. 

The study methods datafor superior pupils 
showed a consistent and fairly large advantage for 
the random grouping treatment. Thus, although the 
ability grouped bright pupil may achieve more, he 
may develop less adequate study methods and be less 
capable in an independent study situationthan his 
counterpart in the heterogeneous classroom. 

When we consider the non-cognitive data for su- 
perior elementary school pupils, perhaps the most 
noteworthy finding is the loss in sociometric status 
and self-concept found for these pupils in the ability 
grouped situation. Some investigators have suggest- 
ed that bright pupils have a somewhat inflated opin- 
ion of their own capabilities. If this isthe case, 
perhaps the lower scores obtained on these meas- 
ures by superior pupils in the ability grouping situ- 
ation reflect a more healthy attitude than is found in 
the random grouped classroom. In elementary 
school classrooms, the teacher exerts considerable 
influence upon the peer group situation, and it may 
be that the teacher’s tendency to place a very high 
value on achievement, results in high achieving pu- 
pils receiving a disproportionate amount of recogni- 
tion. It will be recalled that the sociometric choice 
data, although indicating a reduction in the number 
of superior pupils of star status inthe ability grouped 
classroom did not show an increase in the number of 
neglectee-isolates for this group. Inweighing the to- 
tal treatment effects for superior pupils during the 
intermediate grades in the elementary school, the 
investigator concludes that the ability grouping treat- 
ment is slightly more desirable. 

Average Pupils: In reviewing the results of the 
Utah study for average pupils during the fourth, fifth, 
and sixth grades of the elementary school, we find 
a complex pattern of advantages and disadvantages 
associated with the two grouping treatments. In 
terms of achievement, it seems there is nothing to 
choose between the two treatments for average pu- 
pils. The average pupils, however, were found to 
have better study methods inthe random grouped 
treatment. 

With regard to non-cognitive variables, we find 
that average pupils showed more favorable person- 
ality characteristics, higher self-concept scores, 
and fewer pupil problems in the random grouped sit- 
uation. The only aspect of their experience that 
strongly favored the ability grouping situation was 
the improved sociometric status they gained in this 
treatment. Although this improvement in status is 
not to be taken lightly, it seems that the preponder- 
ance of other variables favoring the random grouped 
situation must lead to the conclusion that the needs 
of most average pupils were better met in the heter- 
ogeneous classroom. 

Slow Pupils: In terms of achievement related var- 
iables, the slow pupil generally showed better per- 
formance inthe heterogeneous classroom. His 
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TABLE 20 


SUMMARY OF COMPARISONS BETWEEN THE TWO GROUPING TREATMENTS 


Elementary Junior High 
Variable Sample Sup. Aver. Slow Sup. Aver. Slow Remarks 
Achievement All A* N R A A R Achievement differences over the 
four years were not large. 
Overachievers and IV, VI A N N Based on first year’s data only. 
Underachievers 
Study Methods and IV, VI R R R N A N Elementary level differences 
Attitudes VUl, largest for superior pupils. 
Sociometric Choice IV, VI R A A In ability grouped classrooms, 
gains of average and slow pupils 
are large. 
Pupil Attitudes IV 
Toward Peers N N N Slow boys had notably better 
Toward Teacher A N A attitudes in ability grouped 
Toward School N N A classes. 
Pupil Problems IV, VI N R N A A A At junior high level, ability group- 
VII, VOI ing was consistently related to 
fewer problems in all replications. 
Self Concept Scores All 
Concept of Self R R R R R R Ability grouping was generally 
Acceptance of Self R R R R R R associated with less favorable 
Ideal Self N R N N N N self-concept scores at all levels 
Discrepancy R R R R R R and for all samples. 
Personality 
1. Feeling of IV R R R Personality measures at elementary 
Belonging level of doubtful validity. 
2. Anti-Social IV R R R 
Tendencies 
3. Ascendancy VI, VII N N N The few significant differences on 
Vill 3, 4, and 5 generally favored Dis- 
trict R, but were not consistent 
between replications. 
4. Inferiority VI, VII N N N 
Vill 
Do. Competent IV N N R 
Assertiveness 
6. Anxiety to VI N R N 
Achieve 
7, CPI Class 1 VI, VII A R N Differences between average 
were more consistent than those 
between superior pupils. 
8. CPI Class 3 VI, VI A N R 
9. Flexibility VI, VI N N N 
10. Projective test 
Aggression IV N N N 
Depression IV N N N 
Inferiority IV N N N 


* A indicates results favoring ability grouping. 
R indicates results favoring random grouping. 
N indicates results favoring neither consistently. 
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achievement gains were rather consistently higher 
in tne random grouped situation as were his study 
habits scores. 

Review of the non-cognitive variables for slow pu- 
pils does not present a clear cut advantagefor ei- 
ther grouping treatment. The most striking result 
of ability grouping for slow pupils was the tremen- 
dous gains they made in sociometric status. The 
slow pupil’s attitudes toward both the school and 
teacher were also somewhat more favorable in the 
ability grouping situation. On the other hand, the 
self-concept data as well as the rather limited per- 
sonality data available at this level tended to favor 
the random grouping. The reader may recall, how- 
ever, that the differences between the two treat- 
ments in self-concept scores were much smaller for 
slow pupils than for superior or average pupils. 
Thus, it is the conclusion of the investigator that the 
large gains in sociometric choice are the most sig- 
nificant single effect of the grouping treatments on 
slow pupils. Therefore, it is concluded that the 
ability grouping system provides a more favorable 
environment for the slow pupil than does the random 
grouping treatment. 


Junior High School 


In reviewing Table 20, the reader will note that 
data on overachievers and underachievers, socio- 
metric choice, and pupil attitudes were not collected 
at the junior high school level. Somewhat more per- 
sonality data, however, were collected at this level, 
and these data can be accepted with somewhat more 
confidence than the personality data obtained during 
the elementary school years. 

Superior Pupils: Ability grouping led to signifi- 
cantly greater achievement gains for superior pupils 
at the junior high school level, althoughthese differ- 
ences were not large. In reviewing the non-cogni- 
tive data, superior pupils inthe ability grouped 
classes reported fewer pupil problems and obtained 
more favorable scores onthe CPI measures of 
poise, ascendency and self-assurance (Class I), as 
well as on measures of achievement potential and 
intellectual efficiency (Class III). These differences 


on the CPI, however, were found in fewer than one- 
half of the replications, and may be regarded as ten- 
tative. When all data are considered, it appears to 
the investigator that ability grouping is slightly more 
advantageous for superior pupils at the junior high- 
school level. 


Average Pupils: The pattern is somewhat the 
same for average groups, with ability grouped pu- 
pils making greater achievement and more favorable 
study methods scores. On the non-cognitive varia- 
bles, average pupils in ability grouped classrooms 
reported fewer problems, but obtained somewhat 
less favorable self-concept scoresthan similar 
groups in the random grouped classrooms. Random 
grouped average pupils also obtained generally more 
favorable scores on the CPI Class I variables as 
well as on the Anxiety to Achieve measures on the 
Cattell OA battery. Again there seems little to 
choose between the two grouping treatments although 
the differences in the cognitive variables seem to 
give ability grouping a slight advantage. 


Slow Pupils: Slow pupils inrandom grouped 
classrooms at the junior high school level general- 
ly achieved more than their ability grouped counter- 
parts. Random grouped slow pupils also received 
somewhat more favorable self-concept scores al- 
though they reported a greater number of problems 
than the ability grouped samples. The personality 
data for slow pupils was characterized by a complete 
lack of consistent differences favoring either treat- 
ment. The lack of significant differences on aggres- 
sion, depression, and inferiority feelings found in 
the projective phase of the personality study leads 
us to question some of the dire consequences that 
have been predicted by critics of ability grouping. 
Although personality data at this age level are at 
best tentative, the inferiority feelings and other neg- 
ative personality characteristics thought to be 
caused by ability grouping of slow pupils did not 
emerge in the Utah study. In considering the over- 
all pattern of differences, however, the investigator 
concludes that the heterogeneous classroom provides 
a Slightly better environment for slow pupils at the 
junior high school level. 
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FOOTNOTES 


The research reported herein was supported by the Cooperative Research 
Program of the Office of Education, U. S. Department of Health, Educa- 
tion, and Welfare. 


. Most of the analysis of within-treatment and sex differences has not been 


included in this monograph, but may be found in the final research report 
prepared for the U. S. Office of Education: An Evaluation of Ability 
Grouping, Cooperative Research Project 577, Logan Utah: Utah State Uni- 
versity, 1964. All aspects of the study are covered in a great deal more 
detail in this final report and the reader who requires such information is 
urged to refer to this report. All major findings related to the relative 
merits of the grouping treatments, however, are contained in this mono- 
graph. 


. A set of statistical hypotheses in the null from were developed as part of 


the original research plan. 


. For slow pupils, t = 2.09; for average, t = 2.44; for superior, t = 2.07. 


A portion of this phase of the Utah Study is based on Dr. Nephi Manning’s 
dissertation, which was completed under Dr. Borg’s supervision. 


Quoted from a personal communication from Dr. Miriam Goldberg, dated 
September 27, 1963, with the author’s permission. 


Quoted from a personal communication from Dr. Miriam Goldberg, dated 
September 27, 1963, with the author’s permission. 


Parts of this chapter are based on Dr. David Brinks’ doctoral research 
carried out under Dr. Borg’s supervision. 


The three sociometric classifications used in this project were star, reg- 
ular, and neglectee-isolate. 


Portions of this section are based upon Mr. Val R. Christenson’s Master’s 
thesis which was carried out under Dr. Borg’s supervision. 


Data on these areas, as well as data on ability level differences, and sex 
differences may be found in the Office of Education, final report CRP577. 
These data were omitted from this monograph in order to conserve space. 


Three other self-concept measures were also used in the Utah study: re- 
sults may be found in the USOE Final Report, CRP577. 


The Discrepancy Score was calculated by a different method in the Utah 
study than recommended by Bills. 


The projective data are based upon the doctoral dissertations of Dr. Luna 
Brite and Dr. Clarence Swaner that were carried out under Dr. Borg’s 
Supervision. 


Corrected using the Spearman- Brown prophecy formula. 
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